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USE OF HEPARANASE TO IDENTIFY AND ISOLATE ANTI-HEPARANASE 

COMPOUND 
FIELD OF INVENTION 
The present invention discloses the use of mammalian heparanase, preferably 
5 recombinant heparan ase, for screening for anti-heparanase compounds. More particularly, the 
present invention provides a method of selecting IHA (Inhibitors of Heparanase Activity). In 
addition, Che present invention provides a purified heparanase, particularly suitable for use to 
identify and isolate anti-heparanase compounds as well as for other known uses of heparanases, 
such as its use to accelerate wound healing or its use as an immobilized heparanase filter 
10 connected to extracorporeal devices to degrade heparin and neutralize its anticoagulant properties 
during surgery. 

BACKGROUND OF THE INVENTION 
Elevated heparanase activity has been documented in mobile, invasive cells. Examples 
in?lM4ffi involve melanoma, lymphoma, mastocytoma, mammary adenocarcinoma, leuksmia, 
15 and rheumatoid fibroblasts. This activity has also been documented in non-pathologic 

situations involving the migration of lymphocytes, neutrophils, macrophages, eosinphils and 
platelets. An inhibitor of heparanase would therefore broadly influence the invasive potential of 
these diverse cells. 

Inhibition of heparan sulfate degradation would also inhibit the release of bound growth 
20 factors and other biologic response modifiers that would, if released, fuel the growth of adjacent 
tissues and provide a supportive environment for cell growth (Rapraeger, et al.. Science 252; 
1705-1708, 1991). Inhibitors of heparanase activity would be of value in the treatment of 
arthritis, vascular restenosis, tumor growth and progression, and fibro-proliferative disorders. 

Until now, the obstacles to designing a screening assay to find inhibitors of mammalian 
25 heparanase have been the unavailability of a mammalian heparanase that is purified to apparent 
homogeneity and the lack of information about the amino acid sequence or the 3-dimensional 
structure of the enzyme. Without the amino acid sequence, it has not been possible to produce 
recombinant mammalian heparanase to be used in large volume screening efforts. Knowledge 
of the tertiary and quaternary structures would facilitate rational design of IHA. This report 
30 overcomes obstacles relating to the sequence of the heparanase, and also provides a model for 
higher-order structure. 

Heparanase refers to a mammalian enzyme which can degrade heparin proteoglycans 
(HPG) and/or heparan sulfate proteoglycans (HSPG). 

Heparanase activity in mammalian cells is well known. It is found in various melanoma 
35 cells (Nakajima, et al. t Cancer Letters 31: 277-283, 1986), mammary adenocarcinoma cells 
(Parish, et al„ Int. J. Cancer, 40: 51 1-518, 1987), leukemic cells (Yahalom, et al, Leukemia 
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Research 12: 711-717, 1988), mast cells (Ogren and Lindahl, /. Biol Chenu 250: 2690-2697, 
1975), macrophages (Savion, et al, J. Cell Physiol, 130: 85-92, 1987), mononuclear cells 
(Sewell, et al., Biochem. J. 264: 777-783, 1989), neutrophils (Matzner, et al, J. Leukocyte 
Biology 51: 519-524, 1992), T-cells (Vettel, etal, Eur. J. Immunol 21: 2247-2251, 1991), 
5 platelets (Haimovitz-Friedman, et al t Blood 78: 789-796, 1991), endothelial cells (Godder, et 
al, J. Cell Physiol 148: 274-280, 1991), and placenta (Klein and von Figura, BBRC 73: 
569, 1976). 

WO 91/02977, incorporated herein by reference, describes a substantially, but partially, 
purified heparanase produced by cation exchange resin chromatography and the affinity 
10 absorbent purification of heparanase-containing cell extract. WO 91/02977 also describes a 
method promoting wound healing utilizing compositions comprising a "purified" form of 
heparanase. 

Others have proposed the use of purified bacterial heparanase, immobilized onto filters 
and connected to extracorporeal devices, to degrade heparin and neutralize its anticoagulant 
15 properties post surgery (Freed, et al, Ann. Biomed. Eng. 21: 67-76, 1993). 

U.S. Patent 4,882,318 describes heparanase-inhibiting compositions for preventing tumor 
metastasis. 

Haimovitz-Friedman et al {Blood 78: 789-796, 1991) describe an assay for heparanase 
activity that involves the culturing of endothelial cells in radiolabeled 35 S0 4 to produce 
20 radiolabeled heparan sulfate proteoglycans, the removal of the cells which leaves the deposited 
extracellular matrix that contains the 35 S-HSPG, the addition of potential sources of heparanase 
activity, and the detection of possible activity by passing the supernatant from the radiolabeled 
extracellular matrix over a gel filtration column and monitoring for changes of the size of the 
radiolabeled material that would indicate that HSPG degradation had taken place. This assay 
25 does not have the capability for large-scale screening of inhibitors. 

Nakajima et al {Anal Biochem. 196: 162-171, 1986) describe a solid-phase substrate 
for the assay of melanoma heparanase activity. Heparan sulfate from bovine lung is chemically 
radiolabeled by reacting it with [ 14 C]-acetic anhydride. Free amino groups of the [ ,4 C]-heparan 
sulfate were acetylated and the reducing termini were aminated. The [ 14 C]-heparan sulfate was 
30 chemically coupled to an agarose support via the introduced amine groups on the reducing 
termini. This substrate is limited in that it is an extensively chemically modified form of 
naturally occurring heparan sulfate. 

Khan and Newman {Anal Biochem. 196: 373-376, 1991) describe an indirect assay for 
heparanase activity. In this assay, heparin is quantitated by its ability to interfere with the color 
35 development between a protein and the dye Coomassie brilliant blue. Heparanase activity is 
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detected by the loss of this interference. This assay is limited in use for screening because it is 
so indirect that other non-heparin compounds could also interfere with the protein-dye reaction. 

The CXC chemokine family (also called the intercrine a family) is one branch of the 
supergene "intercrine" cytokine family (Oppenheim, Ann. Rev. Bio -im. 9: 617-648, 1991). 
5 It's members include platelet factor 4, platelet basic protein and derivatives, 7IP-IO, gro(a,P,Y), 
NAP-l/interieukin-8, mig, and ENA-78 (for review, see Miller and Krangel, Critical Reviews in 
Immunology 12: 17-46, 1992). The other branch, the CC chemokines or intercrine-P family, 
includes MlPla, MIPlpJE/MCP-l, RANTES, and MCAF. All members of both branches of 
this chemokine family characteristically are basic heparin-binding polypeptides, display 

10 molecular weights between 8 and 1 1 kD, share 20 - 50% homology, and function broadly in 
pathologic situ v ions characterized by inflammation and tissue remodeling. 

The proieolytically processed forms of platelet basic protein include CTAP-m, p- 
thromboglobulin, and NAP-2. P-thromboglobulin (Moore, et al. t Biochim. Biophys. Acta. 379: 
360-369, 1975) and CTAP-m (Castor, et al. % Arthritis Rheum. 20: 859-868, 1977), were 

15 originally isolated from activated supernatants or lysates from outdated plti . sts. Using the 
techniques of subcellular fractionation and radioimmunoassay, P-thrombogiobulin was identified 
as an a-granule protein that could be released upon activation (Kaplan, et al. 9 Blood 53: 604- 
618, 1979). Platelet basic protein itself was later isolated from fresh platelets, megakaryocytes, 
and HEL cells, an immortal human erythroleukemia cell line (Holt, et a/.. Biochemistry 25: 

20 1988-1996, 1986; Holt, etaU Exp. Hematol. 16: 302-306, 1988). Walz and Baggiolini 
isolated the processed form of NAP-2 from platelet-containing cultures of stimulated 
mononuclear cells (Walz, et ah, J. Exp. Med. 170: 1745-1750, 1989). 

Material labeled as P-thromboglobulin is commercially available from Calbiochem, San 
Diego, CA (Cat. # 605165), Celsus Laboratories, Cincinnati, OH (Cat # 41705), and 

25 Haematologic Technologies, Essex Jet, VT (Cat. # HBTG-02100. The inventors have 

determined, by using the "Purification Assay," that the commercial preparation have heparanase 
activity at a level of 0.075 units/ug. This activity is below the level of 1 unit/ug needed for the 
screening of anti-heparanase compounds in accordance with the assay of the subject invention. 
U.S. Patent 4,897,348 describes recombinant materials and methods for producing 

30 human connective tissue-activating peptide-DI (CTAP-m) and analogs thereof. 

Transglutaminases catalyze the posttranslau'onal modification of proteins by 
transamidation of available glutamine residues. This action results primarily in the formation of 
epsilon-(gamma-glutamyl)lysine cross-links (Greenberg, et aL, FASEB J. 5: 3071-3076, 1991). 
This posttranslational modification has been reported to dramatically alter the action of some 

35 small proteins.. For example, a transglutaminase produces a glutamine-lysine cross-link in the 13 



WO 95/04158 PCT/US94/08207 

-4- 

kD phospholipase Aj and increases its specific enzymatic activity (Cordella-Miele. et al, J. 
Biol. Chem. 265: 17180-17188, 1990). A transglutaminase cross-links another small molecule, 
interleukin-2, and converts its activity to one that is cytotoxic to mature oligodendrocytes (Eitan 
and Schwartz. Science 261: 106-108,1993). The. glutamine-lysine cross-link in a protein would 
5 result in the loss of overall positive charge for that protein. The transglutaminases are optimally 
active and generally used under reducing conditions such as dithiothreitol. The concept that 
glutamine-lysine cross-Unking alters the activity of these small proteins may be applicable to 
other small molecules as well. 

SUMMARY OF TH E INVENTION 
10 The present invention discloses a method of screening for compounds having anti- 

heparanase activity (AHA compounds), i.e. inhibitors of heparanase activity (IHA). comprising 
the steps of: contacting a potential AHA compound with radiolabeled heparin/heparan sulfate 
and heparanase for a time and under such conditions sufficient to allow for inhibition of 
heparanase activity; detecting inhibition of heparanase activity, and selecting compounds that 
15 inhibit heparanase activity. The present invention also discloses the amino acid sequence 
identity of the heparanase that has been purified to homogeneity by chromatography under 
reducing conditions. Identification of the amino acid sequence of the protein which contains 
heparanase activity is crucial for the production of recombinant mammalian heparanase. 

DETAILED DESCRIPTION OF THE IN VENTION 
20 The present invention provides a purified heparanase. and a method for producing it. 

The heparanase so produced has an activity of greater than 20 units/ pg protein, preferably 
greater than 50 units heparanase activity per pg protein (1 unit - 1% cpm < 30 K/hr using the 
"Purification Assay" (Example 2, Part D). 

In addition, the present invention provides recombinant heparanase and a means for 
25 producing it. The term "purified heparanase" as used in the specification and claims includes 
the recombinant heparanase as described in the subject application. The recombinant heparanase 
of the subject invention can be used for the same purposes and in the same manner as the 
purified heparanase. 

The purified heparanase of the present invention has an isoelectric point of less than 5.5 
30 (preferably about 4.8 - 5.1) and preferably is activated by treatment with transglutaminase using 
reducing conditions. 

The recombinant heparanase of the present invention has an isoelectric point of less than 
5.5 (preferably about 4.8 - 5.1), and is isolated under reducing conditions and is activated by 
treatment with transglutaminase. 
35 Suitable transglutaminases that may be used for this purpose include Activated Factor 

Xma, guinea pig liver transglutaminase, epidermal transglutaminase, keratinocyte 
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transglutaminase, and tissue transglutaminase. 

The heparanase of the present invention has the amino acid sequence (SEQ. ID. NO: 1) 

of: 

Asn Leu Ala Lys Gly Lys Glu Giu Ser Lea Asp Ser Asp LeuTyr Ala 
5 5 10 15 

Glu Leu Arg Cys Met Cys He Lys Thr Thr Ser Gly He His Pro Lys 
20 25 30 

10 Asn lie Gin Ser Leu Glu Val lie Gly Lys Gly Thr His Cys Asn Gin 
35 40 45 



15 



Val Glu Val De Ala Thr Leu Lys Asp Gly Arg Lys He Cys Leu Asp 
50 55 60 

Pro Asp Ala Pro Arg De Lys Lys De Val Gin Lys Lys Leu Ala Gly 
65 70 75 80 



Asp Glu Ser Ala Asp 
20 85 

encoded by the cDNA sequence (SEQ ID NO: 2) of: 

1 AACTTGGCGA AAGGCAAAGA GGAAAGTCTA GACAGTGACT TGTATGCTGA 
51 ACTCCGCTGC ATGTCTATAA AGACAACCTC TGGAATTCAT CCCAAAAACA 
101 TCCAAAGTTT GGAAGTGATC GGGAAAGGAA CCCATTGCAA CCAAGTCGAA 
25 151 GTGATAGCCA CACTGAAGGA TGGGAGGAAA ATCTGCCTGG ACCCAGATGC 
201 TCCCAGAATC AAGAAAATTG TACAGAAAAA ATTGGCAGGT GATGAATCTG 
251 CTGAT 

which corresponds to the cDNA sequence and derived amino acid sequence of CTAP-m. See 
Wenger et al., Blood, 73: 1498-1503, 1989. 
30 In another aspect, the present invention provides a heparanase having the amino acid 

sequence (SEQ ID NO: 3) of: 

Ser Ser Thr Lys Gly Gin Thr Lys Arg Asn Leu Ala Lys Gly Lys Glu 
5 10 15 

35 Glu Ser Leu Asp Ser Asp Leu Tyr Ala Glu Leu Arg Cys Met Cys De 
20 25 30 

Lvs Thr Thr Ser Gly De His Pro Lys Asn Ue Gin Ser Leu Glu Val 
35 40 45 

40 

De Gly Lys Gly Thr His Cys Asn Gin Val Glu Val De Ala Thr Leu 
50 * 55 60 

Lys Asp Gly Arg Lys De Cys Leu Asp Pro Asp Ala Pro Arg De Lys 
45 65 70 75 80 
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Lys lie Val Gin Lys Lys Leu Ala Gly Asp Glu Ser Ala Asp 
85 90 

encoded by the cDNA sequence (SEQ ID NO: 4) of: 

1 TCCTCCACCA AAGGACAAAC TAAGAGAAAC TTGGCGAAAG GCAAAGAGGA 
5 51 AAGTCTAGAC AGTGACTTGT ATGCTGAACT CCGCTGCATG TGTATAAAGA 

101 CAACCTCTGG AATTCATCCC AAAAACATCC AAAGTTTGGA AGTGATCGGG 

151 AAAGGAACCC ATTGCAACCA AGTCGAAGTG ATAGCCACAC TGAAGGATGG 

201 GAOGAAAATC TGCCTGGACC CAGATGCTCC CAGAATCAAG AAAATTGTAC 

251 AGAAAAAATT GGCAGGTGAT GAATCTGCTG AT 
10 which corresponds to the cDNA sequence and derived amino acid sequence of platelet basic 

protein. See Wenger et al.. Blood, 73: 1498-1503, 1989 as well as Walz and Baggiolini, BBRC 

159: 969-981, 1989; Castor, et al, BBRC 163: 1071-1078,1989. 

In another aspect, the present invention provides a heparanase having the amino acid 

sequence (SEQ ID NO: 5) of: 
15 Gly Lys Glu Glu Ser Leu Asp Ser Asp Leu Tyr Ala Glu Leu Arg Cys 

1 5 10 15 



20 



Met Cys He Lys Thr Thr Ser Gly He His Pro Lys Asn He Gin Ser 
20 25 30 

Leu Glu Val lie Gly Lys Gly Thr His Cys Asn Gin Val Glu Val lie 
35 40 45 



Ala Thr Leu Lys Asp Gly Arg Lys De Cys Leu Asp Pro Asp Ala Pro 
25 50 55 60 

Arg ne Lys Lys De Val Gin Lys Lys Leu Ala Gly Asp Glu Ser Ala 
65 70 75 80 

30 Asp 

encoded by the cDNA sequence (SEQ ID NO: 6) of: 

1 GGCAAAGAGG AAAGTCTAGA CAGTGACTTG TATGCTGAAC TCCGCTGCAT 
51 GTGTATAAAG ACAACCTCTG GAATTCATCC CAAAAACATC CAAAGTTTGG 
101 AAGTGATCGG GAAAGGAACC CATTGCAACC AAGTCGAAGT GATAGCCACA 

35 151 CTGAAGGATG GGAGGAAAAT CTGCCTGGAC CCAGATGCTC CCAGAATCAA 
201 GAAAATTGTA CAGAAAAAAT TGGCAGGTGA TGAATCTGCT GAT 
which corresponds to the cDNA sequence and derived amino acid sequence p^thromboglobulin 
(pTG). See Wenger et al.. Blood, 73: 1498-1503, 1989. 

In another aspect, the present invention provides a heparanase having the amino acid 

40 sequence (SEQ ID NO: 7) of: 
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Glu Leu Arg Cys Met Cys lie Lys Thr Thr Ser Gly He His Pro Lys 
1 5 10 15 

Asn He Gin Ser Leu Glu Val He Gly Lys Gly Thr His Cys Asn Gin 
5 20 25 30 

Val Glu Val De Ala Thr Leu Lys Asp Gly Arg Lys lie Cys Leu Asp 
35 40 45 

10 Pro Asp Ala Pro Arg He Lys Lys De Val Gin Lys Lys Leu Ala Gly 
50 55 60 

Asp Glu Ser Ala Asp 
65 

15 

encoded by the cDNA sequence (SEQ ID NO: 8) of: 

1 GAACTCCGCT GCATGTGTAT AAAGACAACC TCTGGAATTC ATCCCAAAAA 
51 CATCCAAAGT TTGGAAGTGA TCGGGAAAGG AACCCATTGC AACCAAGTCG 
101 AAGTGATAGC CACACTGAAG GATGGGAGGA AAATCTGCCT GGACCCAGAT 
20 151 GCTCCCAGAA TCAAGAAAAT TGTACAGAAA AAATTGGCAG GTGATGAATC 
201 TGCTGAT 

which corresponds to the cDN A sequence and derived amino acid sequence of neutrophil 
activating peptide-2. 

The foregoing amino acid sequences correspond to the products of a single gene called 

25 platelet basic protein (Walz and Baggiolini, BBRC 159: 969-981, 1989; Castor, et al, BBRC 
163: 1071-1078, 1989). The complete gene sequence of platelet basic protein is well known. 
See, for example, Wenger et al.. Blood, 73: 1498-1503. 1989 and Proc. Natl Acad. ScL USA. 
90, 3660-3664. 1993. 

The present invention also provides heparanase having the amino acid sequences of 

30 other members of the CXC chemokine family [including Platelet factor 4 (SEQ. ID NO. 12), 
TlP-10 (SEQ. ID NO. 14), gro/MGSA (SEQ. ID NO. 16), gro-&MB>-2a (SEQ. ID NO. 18), 
gr //MIP-2p (SEQ. ID NO. 20), Interleukin-8/NAP-l (SEQ. ID NO. 22) and ENA-78 (SEQ. ID 
NO. 24)] as well as member* of the CC chemokine family [including MHMct (SEQ. ID NO. 
26), MIP-lp (SEQ. ID NO. 28), 1-309 (SEQ. ID NO. 23), MCP-1 (SEQ. ID NO. 32), MCP-3 

35 (SEQ. ID NO. 34), RANTES (SEQ. ID NO. 36), fic (SEQ. ID NO. 38) and MCP-2 (SEQ. ID 
NO. 40)]; purified to apparent homogeneity, prepared in the presence of reducing conditions, 
and activated by treatment with transglutaminase. Suitable transglutaminases that may be used 
for this purpose include Activated Factor Xffla, guinea pig liver transglutaminase, epidermal 
transglutaminase, keratinocyte transglutaminase, and tissue transglutaminase. 

40 In another aspect, the present invention provides a heparanase having the amino acid 
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sequence (SEQ ID NO: 12) of: 

Met Ser Ser Ala Ala Gly Phe Cys Ala Ser Arg Pro Oly Leu Leu Phe Leu Gly Leu Leu 
Leu Leu Pro Leu Val Val Ala Phe Ala Ser Ala Glu Ala Glu Glu Asp Gly Asp Leu Gin 
Cys Leu Cys Val Lys Thr Thr Ser Gin Val Arg Pro Arg His lie Thr Ser Leu Glu Val 
5 He Lys Ala Gly Pro His Cys Pro Hit Ala Gin Leu lie Ala Thr Leu Lys Asn Gly Arg 

Lys lie Cys Leu Asp Leu Gin Ala Pro Leu Tyr Lys Lys lie He Lys Lys Leu Leu Glu 
Ser 

encoded by the cDNA sequence (SEQ ID NO: 13) of: 

1 CCGCAGCATG AGCTCCGCAG CCGGGTTCTG CGCCTCACGC CCCGGGCTGC 
10 51 TGTTCCTGGG GTTGCTGCTC CTGCCACTTG TGGTCGCCTT CGCCAGCGCT 

101 GAAGCTGAAG AAGATGGGGA CCTGCAGTGC CTGTGTGTGA AGACCACCTC 
151 CCAGGTCCGT CCCAGGCACA TCACCAGCCT GGAGGTGATC AAGGCCGGAC 
201 CCCACTCCCC CACTGCCCAA CTGATAGCCA CGCTGAAGAA TGGAAGGAAA 
251 ATTTGCTTGG ACCTGCAAGC CCCGCTGTAC AAGAAAATAA TTAAGAAACT 
15 301 TTTGGAGAGT TAGCTACTAG CTGCCTACGT GTGTGCATTT GCTATATAGC 

351 ATACTTCTTT TTTCCAGTTT CAATCTAACT GTGAAAGAAA CTTCTGATAT 
401 TTGTGTTATC CTTATGATTT TAAATAAACA AAATAAATC 
which corresponds to the cDNA sequence and derived amino acid sequence of platelet factor 4. 
See Poncz et al.. Blood 69, 219-223 (1987). 
20 In another aspect, the present invention provides a heparanase having the amino acid 

sequence (SEQ ID NO: 14) of: 

Met Asn Gm Thr Ala He Leu lie Cys Cys Leu He Phe Leu Thr Leu Ser Gly lie Gin Gly 
Val Pro Leu Ser Arg Thr Val Arg Cys Thr Cys lie Ser He Ser Asn Gin Pro Val Asn Pro 
Val Asn Pro Arg Ser Leu Glu Lys Leu Glu lie lie Pro Ala Ser Gin Phe Cys Pro Arg 
25 Val Glu He lie Ala Thr Met Lys Lys Lys Gly Glu Lys Arg Cys Leu Asn Pro Glu Ser 

Lys Ala He Lys Asn Leu Leu Lys Ala Val Ser Lys Glu Met Ser Lys Arg Ser Pro 
encoded by the cDNA sequence (SEQ ID NO: 15) of: 

1 GAGACATTCC TGAATTGCTT AGACATATTC TGAGCCTACA GCAGAGGAAC 
51 CTCCAGTCTC AGCACCATGA ATCAAACTGC GATTCTGATT TGCTGCCTTA 
30 101 TCTTTCTGAC TCTAAGTGGC ATTCAAGGAG TACCTCTCTC TAGAACCGTA 

151 CGCTGTACCT GCATCAGCAT TAGTAATCAA CCTGTTAATC CAAGGTCTTT 
201 AGAAAAACTT GAAATTATTC CTGCAAGCCA ATTTTGTCCA CGTGTTGAGA 
251 TCATTGCTAC AATGAAAAAG AAGGGTGAGA AGAGATGTCT GAATCCAGAA 
301 TCGAAGGCCA TCAAGAATTT ACTGAAAGCA GTTAGCAAGG AAATGTCTAA 
35 351 AAGATCTCCT TAAAACCAGA GGGGAGCAAA ATCGATGCAG TGCTTCCAAG 
401 GATGGACCAC ACAGAGGCTG CCTCTCCCAT CACTTCCCTA CATGGAGTAT 
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451 ATGTCAAGCC ATAATTGTTC TTAGTTTGCA GTTACAOTAA AAGGTGACCA 
501 ATGATGGTCA CCAAATCAGC TGCTACTACT CCTGTAGGAA GGTTAATGTT 
551 CATCATCCTA AGCTATTCAG TAATAACTCT ACCCTGGCAC TATAATGTAA 
601 GCTCTACTGA GGTGCTATGT TCTTAGTGGA TGTTCTGACC CTGCTTCAAA 
5 which corresponds to the cDNA sequence and derived amino acid sequence 7IP-IO. See Luster 
et al., Nature 315. 672-676 (1985). 

In another aspect, the present invention provides a heparanase having the amino acid 
sequence (SEQ ID NO: 16) of: 

Met Ala Arg Ala Ala Leu Ser Ala Ala Pro Ser Asn Pro Arg Leu Leu Arg Val Ala Leu 
10 Leu Leu Leu Leu Leu Val Ala Ala Gly Arg Arg Ala Ala Gly Ala Ser Val Ala Thr Glu 

Leu Arg Cys Gin Cys Leu Gin Thr Leu Gin Gly He His Pro Lys Asn lie Gin Ser Val 
Asn Val Lys Ser Pro Gly Pro His Cys Ala Gin Thr Glu V-*. He Ala Thr Leu Lys Asn 
Gly Arg Lys Ala Cys Leu Asn Pro Ala Ser Pro lie Val Lys Lys He He Glu Lys Met 
Leu Asn Ser Asp Lys Ser Asn 
15 encoded by the cDNA sequence (SEQ ID NO: 17) of: 

1 CTCGCCAGCT CTTCCGCTCC TCTCACAGCC GCCAGACCCG CCTGCTGAGC 
51 CCCATGGCCC GCGCTGCTCT CTCCGCCGCC CCCAGCAATC CCCGGCTCCT 
101 GCGAGTGGCA CTGCTGCTCC TGCTCCTGGT AGCCGCTGGC CGGCGCGCAG 
151 CAGGAGCGTC CGTGGCCACT GAACTGCGCT GCCAGTGCTT GCAGACCCTG 
20 201 CAGGGAATTC ACCCCAAGAA CATCCAAAGT GTGAACGTGA AGTCCCCCGG 
251 ACCCCACTGC GCCCAAACCG AAGTCATAGC CACACTCAAG AATGGGCGGA 
301 AAGCTTGCCT CAATCCTGCA TCCCCCATAG TTAAGAAAAT CATCGAAAAG 
351 ATGCTGAACA GTGACAAATC CAACTGACCA GAAGGGAGGA GGAAGCTCAC 
401 TGGTGGCTGT TCCTGAAGGA GGCCCTGCCC TTATAGGAAC AGAAGAGGAA 
25 451 AGAGAGACAC AGCTGCAGAG GCCACCTGGA TTGTGCCTAA TGTGTTTGAG 
501 CATCGCTTAG GAGAAGTCTT CTATTTATTT ATTTATTCAT TAGTTTTGAA 
551 GATTCTATGT TAATATTTTA GGTGTAAAAT AATTAAGGGT ATGATTAACT 
601 CTACCTGCAC ACTGTCCTAT TATATTCATT CTTTTTGAAA TGTCAACCCC 
651 AAGTTAGTTC AATCTGGATT CATATTTAAT TTGAAGGTAG AATGTTTTC A 
30 701 AATGTTCTCC AGTCATTATG TTAATATTTC TGAGGAGCCT GCAACATGCC 
751 AGCCACTGTG ATAGAGGCTG GCGGATCCAA GCAAATGGCC AATGAGATCA 
801 TTGTGAAGGC AGGGGAATGT ATGTGCACAT CTGTTTTGTA ACTGTTTAGA 
851 TGAATGTCAG TTGTTATTTA TTGAAATGAT TTCACAGTGT GTGGTCAACA 
901 TTTCTCATGT TGAAACTTTA AGAACTAAAA TGTTCTAAAT ATCCCTTGGA 
35 951 CATTTTATGT CTTTCTTGTA AGGCATACTG CCTTGTTTAA TGGTAGTTTT 
1001 ACAGTGTTTC TGGCTTAGAA CAAAGGGGCT TAATTATTGA TGTTTTCGGA 
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which corresponds to the cDNA sequence and derived amino acid sequence of gro/MGSA 
(melanoma growth stimulatory activity). See Anisowicz et al. ( Proc. Nad. Acad. ScL U.S.A. 
84, 7188-7192 (1987). 

In another aspect, the present invention provides a heparanase having the amino acid 
5 sequence (SEQ ED NO: 18) of: 

Met Ala Arg Ala Thr Leu Ser Ala Ala Pro Ser Asn Pro Arg Leu Leu Arg Val Ala Leu 
Leu Leu Leu Leu Leu Val Ala Ala Ser Arg Arg Ala Ala Gly Ala Pro Lys Ala Thr Glu 
Lys Arg Cys Gin Cys Lys Gin Thr Leu Gin Gly He His Leu Lys Asn De Gin Ser Val 
Lys Val Lys Ser Pro Gly Pro His Cys Ala Gin Thr Glu Val lie Ala Thr Leu Lys Asn 
10 Gly Gin Lys Ala Cys Leu Asn Pro Ala Ser Pro Met Val Lys Lys lie He Glu Lys Met 

Leu Lys 

encoded by the cDNA sequence (SEQ ID NO: 19) of: 

1 CTCTCCTCCT CGCACAGCCG CTCGAACCGC CTGCTGAGCC CCATGGCCCG 
51 CGCCACGCTC TCCGCCGCCC CCAGCAATCC CCGGCTCCTG CGGGTGGCGC 

15 101 TGCTGCTCCT GCTCCTGGTG GCCGCCAGCC GGCGCGCAGC AGGAGCGCCC 

151 CTGGCCACTG AACTGCGCTG CCAGTGCTTG CAGACCCTGC AGGGAATTCA 
201 CCTCAAGAAC ATCCAAAGTG TGAAGGTGAA GTCCCCCGGA CCCCACTGCG 
251 CCCAAACCGA AGTCATAGCC ACACTCAAGA ATGGGCAGAA AGCTTGTCTC 
301 AACCCCGCAT CGCCCATGGT TAAGAAAATC ATCGAAAAGA TGCTGAAAAA 

20 351 TGGCAAATCC AACTGACCAG AAGGAAGGAG GAAGCTTATT GGTGGCTGTT 
401 CCTGAAGGAG GCCCTGCCCT TACAGGAACA GAAGAGGAAA GAGAGACACA 
451 GCTGCAGAGG CCACCTGGAT TGCGCCTAAT GTGTTTGAGC ATCACTTAGG 
501 AGAAGTCTTC TATTTATTTA TTTATTTATT TATTTGTTTG TITTAGAAGA 
551 TTCTATGTTA ATATTTTATG TGTAAAATAA GGTTATGATT GAATCTACTT 

25 601 GCACACTCTC CCATTATATT TATTGTTTAT TTTAGGTCAA ACCCAAGTTA 

651 GTTCAATCCT GATTCATATT TAATTTGAAG ATAGAAGGTT TGCAGATATT 
701 CTCTAGTCAT TTGTTAATAT TTCTTCGTGA TGACATATCA CATGTCAGCC 
751 ACTGTGATAG AGGCTGAGGA ATCCAAGAAA ATGGCCAGTG AGATCAATGT 
801 GACGGCAGGG AAATGTATGT GTGTCTATTT TGTAACTGTA AAGATGAATG 

30 851 TCAGTTGTTA TTTATTGAAA TGATTTCACA GTGTGTGGTC AACATTTCTC 

901 ATGTTGAAGC TTTAAGAACT AAAATGTTCT AAATATCCCT TGGACATTTT 
951 ATGTCTTTCT TGTAAGGCAT ACTGCCTTGT TTAATGTTAA TTATGCAGTG 
1001 TTTCCCTCTG TGTTAGAGCA GAGAGGTTTC GATATTTATT GATGTTTTCA 
1051 CAAAGAACAG GAAAATAAAA TATTTAAAAA T 

35 which corresponds to the cDNA sequence and derived amino acid sequence gro-p/MIP-2a 
(macrophage inflammatory protein 2-a). See Tekamp-Olson et al., J. Exp. Med 172, 911-919 
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(1990). 

In another aspect, the present invention provides a heparanase having the amino acid 
sequence (SEQ ID NO: 20) of: 

Met Ala His Ala Thr Leu Ser Ala Ala Pro Ser Asn Pro Arg Leu Leu Arg Val Ala Leu 
5 Leu Leu Leu Leu Leu Val Ala Ala Ser Arg Arg Ala Ala Gly Ala Ser Val Val Thr Glu 

Leu Arg Cys Gin Cys Leu Gin Thr Leu Gin Gly lie His Leu Lys Asn lie Gin Ser Val 
Asn Val Arg Ser Pro Gly Pro His Cys Ala Gin Thr Glu Val De Ala Thr Leu Lys Asn 
Gly Lys Lys Ala Cys Leu Asn Pro Ala Ser Pro Met Val Gin Lys lie lie Glu Lys lie Leu 
Asn Lys Gly Ser Thr Asn 
10 encoded by the cDNA sequence (SEQ ID NO: 21) of: 

1 CTCGCACAGC TTCCCGACGC GTCTGCTGAG CCCCATGGCC CACGCCACGC 
51 TCTCCGCCGC CCCCAGCAAT CCCCGGCTCC TGCGGGTGGC GCTGCTGCTC 
101 CTGCTCCTGG TGGCCGCCAG CCGGCGCGCA GCAGGAGCGT CCGTGGTCAC 
151 TGAACTGCGC TGCCAGTGCT TGCAGACACT GCAGGGAA1T CACCTCAAGA 
15 201 ACATCCAAAG TGTGAATGTA AGGTCCCOCG GACCCCACTG CGCCCAAACC 
251 GAAGTCATAG CCACACTCAA GAATGGGAAG AAAGCTTGTC TCAACCCCGC 
301 ATCCCCCATG GTTCAGAAAA TCATCGAAAA GATACTGAAC AAGGGGAGCA 
351 CCAACTGACA GGAGAGAAGT AAGAAGCTTA TCAGCGTATC ATTGACACTT 
401 CCTGCAGGGT GGTCCCTGCC CTTACCAGAG CTGAAAATGA AAAAGAGAAC 
20 451 AGCAGCTTTC TAGGGACAGC TGGAAAGGAC TTAATGTGTT TGACTATTTC 
501 TTACGAGGGT TCTACTTATT TATGTATTTA TTTTTGAAAG CTTGTATTTT 
551 AATATTTTAC ATGCTGTTAT TTAAAGATGT GAGTGTGTTT CATCAAACAT 
601 AGCTCAGTCC TGATTATTTA ATTGGAATAT GATGGGTTTT AAATGTGTCA 
651 TTAAACTAAT ATTTAGTGGG AGACCATAAT GTGTCAGCCA CCTTGATAAA 
25 701 TGACAGGGTG GGGAACTGGA GGGTGGGGGG ATTGAAATGC AAGCAATTAG 
751 TGGATCACTG TTAGGGTAAG GGAATGTATG TACACATCTA TTTTTTATAC 
801 Tim 1 1 1 1 A AAAAAAGAAT GTCAOTTGTT ATTTATTCAA ATTATCTCAC 
851 ATTATGTGTT CAACATTTTT ATGCTGAAGT TTCCCTTAGA CATTTTATGT 
901 crTGCTTGTA GGGCATAATG CCTTGTTTAA TGTCCATTCT GCAGCGTITC 
30 951 TCTTTCCCTT GGAAAAGAGA ATTTATCATT ACTGTTAC 

which corresponds to the cDNA sequence and derived amino acid sequence gro-yfMJP-2$ 
(macrophage inflammatory protein 2-p). See Tekamp-Olson et al., J. Exp. Med. 172, 911-919 
(1990). 

In another aspect, the present invention provides a heparanase having the amino acid 
35 sequence (SEQ ID NO: 22) of: 

Met Thr Ser Lys Leu Ala Val Ala Leu Leu Ala Ala Phe Leu lie Ser Ala Ala Leu Cys 
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Glu Gly Ala Val Leu Pro Arg Ser Ala Lys Glu Leu Arg Cys Gin Cys lie Lys Thr Tyr 
Ser Lys Pro Phe His Pro Lys Phe lie Lys Glu Leu Arg Val lie Glu Ser Gly Pro His Cys 
Ala Asn Thr Glu He He Val Lys Leu Ser Asp Gly Arg Glu Leu Cys Leu Asp Pro Lys 
Glu Asn Trp Val Gin Arg Val Val Glu Lys Phe Leu Lys Arg Ala Glu 
5 encoded by the cDNA sequence (SEQ ID NO: 23) of: 

1 ATGACTTCCA AGCTGGCCGT GGCTCTCTTG GCAGCCTTCC TGATTTCTGC 
51 AGCTCTGTGT G AAG GTGCAG TTTTGCCAAG GAGTGCTAAA GAACTTAGAT 
101 GTCAGTGCAT AAAGACATAC TCCAAACCTT TCCACCCCAA ATTTATCAAA 
151 GAACTGAGAG TGATTGAGAG TGGACCACAC TGCGCCAACA CAGAAATTAT 
10 201 TGTAAAGCTT TCTGATGGAA GAGAGCTCTG TCTGGACCCC AAGGAAAACT 
251 GGGTGCAGAG GGTTGTGGAG AAGTTTTTGA AGAGGGCTGA G 
which corresponds to the cDNA sequence and derived amino acid sequence Interleukin-8/NAP-l 
(neutrophil activating protein-1). See Kunser et al., Kidney Int. 39. 1240-1248 (1991). 

In another aspect, the present invention provides a heparanase having the amino acid 
15 sequence (SEQ ID NO: 24) of: 

Ala Gly Pro Ala Ala Ala Val Leu Arg Glu Lys Arg Cys Val Cys Leu Gin Thr Thr Gin 
Gly Val His Pro Lys Met lie Ser Asn Leu Gin Val Phe Ala He Gly Pro Gin Cys Ser 
Lys Val Glu Val Val Ala Ser Leu Lys Asn Gly Lys Glu He Cys Leu Asp Pro Glu Ala 
Pro Phe Leu Lys Lys Val He Gin Lys ne Leu Asp Gly Gly Asn Lys Glu Asn 
20 encoded by the cDNA sequence (SEQ ID NO: 25) of: 

1 GTGTTGCGGG AACTGCGGTG CGTGTGTTTA CAGACCACGC AGGGAGTTCA 
51 TCCCAAAATG ATCAGTAATC TGCAAGTGTT CGCCATAGGC CCACAGTGCT 
101 CCAAGGTGGA AGTGGTAGCC TCCCTGAAGA ACGGGAAGGA AATTTGTCTT 
151 GATCCAGAAG CCCCTTTTCT AAAGAAAGTC ATCCAGAAAA TCCTCGACGG 
25 201 CGGCAACAAA GAAAAC 

which corresponds to the cDNA sequence and derived amino acid sequence of a novel 
inflammatory peptide (ENA-78) with homology to interleukin 8. See Walr et al.. I. Exp. Med. 

174. 1355-1362 (1991). 

In another aspect, the present invention provides a heparanase having the amino acid 

30 sequence (SEQ ID NO: 26) of: 

Met Gin Val Ser Thr Ala Ala Leu Ala Val Leu Leu Cys Thr Met Ala Leu Cys Asn Gin 
Val Leu Ser Ala Pro Leu Ala Ala Asp Thr Pro Thr Ala Cys Cys Phe Ser Tyr Thr Ser 
Arg Gin He Pro Gin Asn Phe He Ala Asp Tyr Phe Glu Thr Ser Ser Gin Cys Ser Lys 
Pro Ser Val He Phe Leu Thr Lys Arg Gly Arg Gin Val Cys Ala Asp Pro Ser Glu Glu 

35 Trp Val Gin Lys Tyr Val Ser Asp Leu Glu Leu Ser Ala 

encoded by the cDNA sequence (SEQ ID NO: 27) of: 
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1 GAATTCAAGG CCTGTCCTGG TTTGGTCCCA ATTTACCTTT ATCATCCATA 
51 TTCACCCCCA CTGCTCTGCA GCTCCACTGA AGCACCCCCT CTTTCCTCTG 
101 AGCCACAATG TCACACCCAG GACTCTGCCT CAGCTGGGCC TCCACTGCCC 
151 ACCCATCTAT AGATGCCTAA ATCCCGGGCA GTTATCCAGA CACAACTAAA 

5 201 GTTCCATCCC TTCCATGAAG CCTTCCCCAA CCCTCTGGTG GAAGGTCACT 
251 TCTTCCTCAT GGGGTTCTGA GCTTTCATTT CTTTTTCTAC TAAGAGTTTT 
301 ACAATTACCT GTTCATACAC TCTACCTGCC CCCATGAGAC CAGGGGCATC 
351 TCAGAAACAA AGATCATTAA AACCAACTAA ATCTATTTCT CATTATAAAA 
401 TGAGATATGC TGATTGATTG CAAAATAATA AAATAACAAA GTATGGAAAA 

10 451 GAAAAAAAAA AGCATATAAT CTGGCTGAGA AGGTAGAGAC CCTTCCACAC 
501 CACTGAAATT ATGTGTTGAA AAGAATAAGG AAAAAACTGC TTCAGTTTGG 
55 1 CATTATTTAT GTAAGTATAG TATAGGATCC TTAAAATGGT TCAAAGAAAT 
601 GGGAAATCAA GACTTCATTT TGGCAAAGCC ATTGAACAGA AACTGTAGCA 
651 TATTTATCAG TAATTTCTTT CAGATTAAAC AACTGACAAC AACCCACTTT 

15 701 TCAACCAGTG ATGTTGGAAA TGTTTTAAAA CAAAATTAGT TCATAAATTT 
751 GTGGGTTGAC CAAGAAGGTA ATAAAGTCTC ACTAAATAAA ATGAGGAAAA 
801 TTCAGAAAAA GAAAAAAATA AOAAAATAAA TCACCCATGG ATCTAAGCAC 
851 TATTCATTCT TTAAGGCATG TATTTCCAAG CCTTTTAATT TTTTCATGCC 
901 TAGAGTTGGC ATGGCATATA TATATCTTTA TACAATTCTT CAAATTTTAT 

20 951 AGAATTTGTA TAATGTTTTA TCTTGCTTTT TTTTTAACCA CTGATGTTAT 

1001 AAGCATATTT ATGCCACTTC ATTCACGTTA GAGACTTAAT AATAAAGGAT 
1051 CTTGTGGATA ATTTATCATT CCCTGATAGA GAAAAATTTA GCTTTGCTTA 
1 101 TTTTAGAGTT ATAAATGATG CTGGGTCAGG TATCTTTATG TTTGAAGATG 
1151 GCTCCATATT TGGGTTGTTT CCACAGAACT CTTTCCAGAA ATGCTTTTTC 

25 1201 TAGGTTAATG GCTACACATA TTTCTAGGCA CCTGACATAC TGACACCCAC 
1251 CTCTAAAGTA TTTTTATGAT CCACAACTAG CGTTTAACAC AGCGCCCCAG 
1301 TCACTCCGAG ACTAATAAAT AGACAAATGA CTGAAACGTG ACCTCATGCT 
1351 TTCTATTCCT CCAGCTTTCA TTGAGTTCCT TTCCTCTGGG AGGACTGGGG 
1401 GTTGTCTAGC CCTCCACAGC ATCAGCCCAT TGACCCTATC CTTGTGGTTA 

30 1451 TAGCAGCTGA GGAAGCAGAA TT ' AGCTCT GTGGGAAGGA ATGGGGCTGG 
1501 AGAGTTCATG CATAGACCAA TTClTTTTTT m i l l 1 ill TGAGATGGAG 
1551 TTTCACTTTT GTTGCCCAGG CTGGAGTGCA ATGGCATGAT CTCAGCTCAC 
1601 CACAGCCCCC ACCTCCTGGG TTCAAGCGAT TCTCCTGCCC TCAGCCTCCC 
1651 GAGTAGCTGG GATTACAGGC ATGTGCCACC ACGCCTGACT ACTnTGTAT 

35 1701 TTTTAGTAGA GATGGAGTTT CTCTTTCTTG GTCAGGTTGG TCTCAAACTC 
1751 CTGACCTCAG GTGATCTGCA GCCTCGGCCT CCAAAGTGTT GGGATTACAG 
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1801 GTGTGAGCGA CCATGCCTGG CTGCATAGAC CAGTTCTTAT GAGAAGGGAT 
1851 CAACTAAGAA TAGCCTTGGG TTGACACACA CCCCTCTTCA CACTCACAGG 
1901 AGAAACCCCA TGAAGCTAGA ACCAGTCATG AGTTGAGAGC TGAGAGTTAG 
1951 AGAGTAGCTC AGAGATGCTA TTCTTGGATA TCCTGAGCCC CTGTGGTCAC 

5 2001 CAGGGACCCT GAGTTGTGCA ACACTCAGCA TGACAGCATC ACTACACTTA 
2051 AAAATTTCCC TCCTCACCCC CAGATTCCAT TTCCCCATCC GCCAGGGCTG 
2101 CCTATAAAGA GGAGAGATGG CTTCAGACAT CAGAAGGACG CAGGCAGCAA 
2151 AGAGTAGTCA GTCCCTTCTT GGCTCTGCTG ACACTCGAGC CCACATTCCA 
2201 TCACCTGCTC CCAATCATGC AGGTCTCCAC TGCTGCCCTT GCCGTCCTCC 

10 2251 TCTGCACCAT GGCTCTCTGC AACCAGGTCC TCTCTGCACC ACGTGAGTCC 
2301 ATGTTGTTGT TGTGGGTATC ACCACTCTCT GGCCATGGTT AGACCACATC 
2351 AGTCTTTTTT TGTGGCGTGA GAGGCCCCGA AGAGAAAAGA AGGAAGTTCT 
2401 TAAAGCGCTG CCAAACACCT TGGTCTTTTT CTTCACAACT TTTATTTTTA 
2451 TCTCTAGAAG GGGTCTTAGC CCTCCTAGTC TCCAGGTATG AGAATCTAGG 

15 2501 CAGGGGCAGG GGAGTTACAG TCCCTTGTAC AGATAGAAAA ACAGGGTTCA 
2551 AAACGAATCA GTTTGCAAGA GGCAGAATCC AGGGCTGCTT ACTTCCCAGT 
2601 GGGGTCTGTT CTTCACTCTC CAGCTCACCC TAGTCTCCCA GGAGCCCTGT 
2651 CCCTTGGATG TCTTATGAGA GATGTCCAGG GCTTCTCTTG GGCTGGGGTA 
2701 TGACTTCTTG AACCGACAAA ATTCCATGAA GAGAGCTAAG AGAACAGTGC 

20 2751 ATTCAGGTAT CTGGATCACA TAGAGAAACA GAGAACCCAC TATGAAGAGT 
2801 CAAGGGGAAA GAGGAATATA GACAGAAACA AAGAGACATT TCTCTGCAAA 
2851 ACCCCCCAAA TGCCTTGCAG TCACTTGGTC TGAGCAAGCC TGCCCTCCTC 
2901 AACCACTCAG GGATCAGAAG CTGCCTGGCC TTTTCTTCTG AGCTGTGACT 
2951 TGGGCTTATT CTCTCCTTTC TCCGCAGTTG CTGCTGACAC GCCOACCGCC 

25 3001 TGCTGCTTCA GCTACACCTC CCGACAGATT CCACAGAATT TCATAGCTGA 
3051 CTACTTTGAG ACGAGCAGCC AGTGCTCCAA GCCCAGTGTC ATGTAAGTGC 
3101 CAGTCTTCCT GCTCACCTCT AGGGAGGTAG GGAGTGTCAG GGTGGGGGCA 
3151 GAAACAGGCC AGAAGGCCAT CCTGGAAAGG CCCAGCCTTC AGGAGCCTAT 
3201 CGGGGATACA GGACGCAGGG CACTGAGGTG TGACCTGACT TGGGGCTGGA 

30 3251 GTGAGGTGGG TGTTACAGAG TCAGGAAGGG CTGCCCCAGG CCAGAGGAAA 
3301 GGGACAGGAA GAAGGAGGCA GCAGGACACT CTGAGGGCCC CCTTGCCTGG 
3351 AGTCACTGAG AGAAGCTCTC TAGACGGAGA TAGGCAGGGG GCCCCTGAGA 
3401 GAGGAGCAGG CCTTGAGCTG CCCAGGACAG AGAGCAGGAT GTCAGGGCCA 
3451 TGGTGGGCCC AGGATTCCCC GGCTGGATTC (XCAGTGCTT AACTCTTCCT 
35 3501 CCCTTCTCCA CAGCTTCCTA ACCAAGAGAG GCCGGCAGGT CTGTGCTGAC 
3551 CCCAGTGAGG AGTGGGTCCA GAAATACGTC AGTGACCTGG AGCTGAGTGC 
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3601 CTGAGGGGTC CAGAAGCTTC GAGGCCCAGC GACCTCAGTG GGCCCAGTGG 
3651 GGAGGAGCAG GAGCCTGAGC CTTGGGAACA TGOGTGTGAC CTCCACAGCT 
3701 ACCTCTTCTA TGGACTGGTT ATTGCCAAAC AGCCACACTG TGGGACTCTT 
3751 CTTAACITAA ATTTTAATTT ATTTATACTA TTTAGTTTTT ATAATTTATT 
5 3801 TTTGATTTCA CAGTGTGTTT GTGATTGTTT GCTCTG GAG TTCCCCCTGT 
3851 CCCCTCCACC TTCCCTCACA GTGTGTCTGG TGACAA(XGA GTGGCTGTCA 
3901 TCGGCCTGTG TAGGCAGTCA TGGCACCAAA GCCACCAGAC TGACAAATGT 
3951 GTATCAGATG CTTTTGTTCA GGGCTGTGAT CGGCCTGGG*" AAATAATAAA 
4001 QATGTTCTTT TAAACGGTAA ACCAGTATTG AGTTTGGTTT TGTTTTTCTG 
10 4051 GCAAATCAAA /VTCACTGGTT AAGAGGAATC ATAGGCAAAG ATTAGGAAGA 
4101 GGTGAAATGG AGGGAAATTG GGAGAGATGG GGAGCGCTGC GACAGAGTTA 
4151 TCCACTTCAC AAAATTCTGG AACATTGAAA CTAOGAATAT GOATAACTCT 
4201 AAATCGTAAT ATGCACGCTC TAGGAGAATT AACTACTTGA ATGGCCACC. 
4251 TTAAGCAGAG TATTCTGTAG GGCATATTCA TGATGAATCA AGCTCTTAAT 
15 4301 AGCAATTATT TACATTGTTG AGGCTTACTC CTCCTACTGA GTGCTTTTTA 
4351 TACATTGTTC ATTTAATCTT ACCAATGCAA TAGTACAGCT TAGGTACTAT 
4401 TAATACCTCC ACTTGACAGA AAAGTAACCC AGGGCTCAGA AAGGTTAGAC 
4451 AACTTGGCTG AGGTTACACA GCACGTAAAC GGTCAATTGT GTTCCAAAAC 
4501 TGGACTTTTA TTGAACTACA GACTATGCTG TTAACCATTG ACCAAGTTAT 
20 4551 TTCCCAAAGT ATGACCCGCC TATACTCAAA TCTTACCCCA TTCTTTAACA 
4601 GATGATACTT TATCCATTGC AACCACTTCC TGTCAGGATT CTGAGTTGAC 
4651 ATAGAGTGTT TCAGCAGTGA TTATTTAAGC CAATTACATC AGGATCTTTA 
4701 GGTGTAGACC TGGGAACTGA TATTTTTATC AAGCTCATGA GGTGTTCCAT 
4751 AGCATGTTAA TGACTGAGAG OCACTGTCAA TAGAATTC 
25 which corresponds to the cDNA sequence and derived amino acid sequence MIP-la 

(macrophage inflammatory protein 1-a). See Blum et al., DNA Cell Biol. 9. 589-602 (1990). 

In another aspect, the present invention provides a heparanase having the amino acid 
sequence (SEQ ID NO: 28) of: 

Met Lys Leu Cys Val Thr Val Leu Ser Leu Leu Met Leu Val Ala Ala Phe Cys Ser Pro 
30 Ala Leu Ser Ala Pro Met Gly Ser Asp Pro Pro Thr Ala Cys Cys Phe Ser Tyr Thr Ala 

Arg Lys Leu Pro Arg Asn Phe Val Val Asp Tyr Tyr Glu Thr Ser Ser Leu Cys Ser Gin 
Pro Ala Val Val Phe Gin Thr Lys Arg Ser Lys Gin Val Cys Ala Asp Pro Ser Glu Ser 
Trp Val Gin Glu Tyr Val Tyr Asp Leu Glu Leu Asn 
encoded by the cDNA sequence (SEQ ID NO: 29) of: 
35 1 TTCCCCCCCC CCCCCCCCCC CCCOGCCCGA GCACAGGACA CAGCTGGGTT 

51 CTGAAGCTTC TGAGTTCTGC AGCCTCACCT CTGAGAAAAC CTCTTTTCCA 
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101 CCAATACCAT G A AG CTCTGC GTGACTGTCC TGTCTCTCCT CATGCTAGTA 
151 GCTGCCTTCT GCTCTCCAGC GCTCTCAGCA CCAATGGGCT CAGACCCTCC 
201 CACCGCCTGC TGCTTTTCTT ACACCGCGAG GAAGCTTCCT CGCAACTTTG 
251 TGGTAGATTA CTATGAGACC AGCAGCCTCT GCTCCCAGCC AGCTGTGGTA 

5 301 TTCCAAACCA AAAGAAGCAA GCAAGTCTGT GCTGATCCCA GTGAATCCTG 
351 GGTCCAGGAG TACGTGTATG ACCTGGAACT GAACTGAGCT GCTCAGAGAC 
401 AGGAAGTCTT CAGGGAAGGT CACCTGAGCC CGGATGCTTC TCCATGAGAC 
451 ACATCTCCTC CATACTCAGG ACTCCTCTCC GCAGTTCCTG TCCCTTCTCT 
501 TAATTTAATC TTTTTTATGT GCCGTGTTAT TGTATTAGGT GTCATTTCCA 

10 551 TTATTTATAT TAGTTTAGCC AAAGGATAAG TGTCCTATGG GGATGGTCCA 
601 CTGTC ACTGT TTCTCTGCTG TTGCAAATAC ATGGATAACA CATTTGATTC 
651 TGTGTGTTTT CCATAATAAA ACTTTAAAAT AAAATGCAGA CAGTTA 
which corresponds to the cDNA sequence and derived amino acid sequence MIP-1P 
(macrophage inflammatory protein 1-0). See Lipes et al„ Proc. Natl. Acad. Sci. U.S.A. 85, 

15 9704-9708 (1988). 

In another aspect, the present invention provides a heparanase having the amino acid 

sequence (SEQ ID NO: 30) of: 

Met Gin lie lie Thr Thr Ala Leu Val Cys Leu Leu Leu Ala Gly Met Trp Pro GIu Asp 
Val Asp Ser Lys Ser Met Gin Val Pro Phe Ser Arg Cys Cys Phe Ser Phe Ala Glu Gin 
20 Glu lie Pro Leu Arg Ala lie Leu Cys Tyr Arg Asn Thr Ser Ser He Cys Ser Asn Glu 

Gly Leu De Phe Lys Leu Lys Arg Gly Lys Glu Ala Cys Ala Leu Asp Thr Val Gly Trp 
Val Gin Arg His Arg Lys Met Leu Arg His Cys Pro Sex Lys Arg Lys 
encoded by the cDNA sequence (SEQ ID NO: 31) of: 

1 ACCAGGCTCA TCAAAGCTGC TCCAGGAAGG CCCAAGCCAG ACCAGAAGAC 
25 51 ATGCAGATCA TCACCACAGC CCTGGTGTGC TTGCTGCTAG CTGGGATGTG 

101 GCCGGAAGAT GTGGACAGCA AGAGCATGCA GGTACCCTTC TCCAGATGTT 
151 GCTTCTCATT TGCGGAGCAA GAGATTCCCC TGAGGGCAAT CCTGTGTTAC 
201 AGAAATACCA GCTCCATCTG CTCCAATGAG GGCTTAATAT TCAAGCTGAA 
251 GAGAGGCAAA GAGGCCTGCG CCITGGACAC AGTTGGATGG GTTCAGAGGC 
30 301 ACAGAAAAAT GCTGAGGCAC TGCCCGTCAA AAAGAAAATG AGCAGATTTC 
351 TTTCCATTGT GGGCTCTGGA AACCACATGG CTTCACCTGT CCCCGAAACT 
401 ACCAGCCCTA CACCATTCCT TCTGCCCTGC TTTTGCTAGG TCACAGAGGA 
451 TCTGCTTGGT CTTGATAAGC TATGTTGTTG CACTTTAAAC ATTTAAATTA 
501 TACAATCATC AACCCCCAAC 
35 which corresponds to the cDNA sequence and derived amino acid sequence human secreted 
protein (1-309). See Miller et aL. J. Immunol. 143, 2907-2916 (1989). 
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In another aspect, the present invention provides a heparanase having the amino acid 
sequence (SEQ ID NO: 32) of: 

Met Lys Val Ser Ala Ala Leu Leu Cys Leu Leu Leu lie Ala Ala Thr Phe lie Pro Gin 
Gly Lys Ala Gin Pro Asp Ala lie Asn Ala Pro Val Thr Cys Cys Tyr Asn Phe Thr Asn 
5 Arg Lys De Ser Val Gin Arg Leu Ala Ser Tyr Arg Arg lie Thr It- Ser Lys Cys Pro Lys 

Glu Ala Val Be Phe Lys Thr De Val Ala Lys Glu De C Ala A , LTo Lys Gin Lys Tip 
Val Gin Asp Ser Met Asp His Leu Asp Lys Gin Thr Gin Thr Pro Lys Thr 
encoded by the cDNA sequence (SEQ ID NO: 33) of: 

1 CTAACCCAGA AACATCCAAT TCTCAAACTG AAGCTCGCAC TCTCGCCTCC 
10 51 AGCATGAAAG TCTCTGCCGC CCTTCTGTGC CTGCTGCTCA TAGCAGCCAC 

101 CTTCATTCCC CAAGGGCTCG CTCAGCCAGA TGCAATCAAT GCCCCAGTCA 
151 CCTGCTGTTA TAACTTCACC AATAGGAAGA TCTCAGTGCA GAGGCTCGCG 
201 AGCTATAGAA GAATCACCAG CAGCAAGTGT CCCAAAGAAG CTGTGATCTT 
251 CAAGACCATT GTGGCCAAGG AGATCTGTGC TGACCCCAAG CAGAAGTGGG 
15 301 TTCAGGATTC CATGGACCAC CTGGACAAGC AAACCCAAAC TCCGAAGACT 

351 TGAACACTCA CTCCACAACC CAAGAATCTG CAGCTAACTT ATTTTCCCCT 
401 AGCTITCCCC AGACACCCTG TTTTATTTTA TTATAATGAA TTTTGTTTGT 
451 TGATGTGAAA CATTATGCCT TAAGTAATGT TAATTCTTAT TTAAGTTATT 
501 GATGTTTTAA GTITATCTTT CATGGTACTA GTGTTTTTTA GATACAGAGA 
20 551 CTTGGGGAAA TTGCTTTTCC TCTTGAACCA CAGTTCTACC CCTGGGATGT 
601 TTTGAGGGTC TTTGCAAGAA TCATTAATAC AAAGAATTTT TTTTAACATT 
651 CCAATGCATT GCTAAAATAT TATTGTGGAA ATGAATATTT TGTAACTATT 
701 ACACCAAATA AATATATTTT TGTAC 
which corresponds to the cDNA sequence and derived amino acid sequence monocyte 
25 chemoattractant protein 1 (MCP-1). See Yoshimura et al., FEBS Lett. 244, 487-493 (1989). 

In another aspect, the present invention provides a heparanase having the amino acid 
sequence (SEQ ID NO: 34) of: 

Met Lys Ala Ser Ala Ala Leu Leu Cys Leu Leu Leu Thr Ala Ala Ala Phe Ser Pro Gin 
Gly Leu Ala Gin Pro Val Gly He Asn Thr Ser Thr Thr Cys Cys Tyr Arg Phe lie Asn 
30 Lys Lys He Pro Lys Gin Arg Leu Glu Ser Tyr Arg Arg Thr Thr Ser Ser His Cys Pro 

Are Qlu Alfl val lie Phe ky? Tte ty* A?p l-y? Qi» De (?y? Ah* a?p Pre Tiff Gin 

Lys Tip Val Gin Asp Phe Met Lys His Leu Asp Lys Lys Thr Gin Thr Pro Lys Leu 
encoded by the cDNA sequence (SEQ ID NO: 35) of: 

1 AGCAGAGGGG CTGAGACCAA ACCAGAAACC TCCAATTCTC ATGTGGAAGC 
35 51 CCATGCCCTC ACCCTCCAAC ATGAAAGCCT CTGCAGCACT TCTGTGTCTG 

101 CTGCTCACAG CAGCTGCTTT CAGCCCCCAG GGGCTTGCTC AGCCAGTTGG 
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151 GATTAATACT TCAACTACCT GCTGCTACAG ATTTATCAAT AAGAAAATCC 
201 CTAAGCAGAG GCTGGAGAGC TACAGAAGGA CCACCAGTAG CCACTGTCCC 
251 CGGGAAGCTG TAATCTTCAA GACCAAACTG GACAAGGAGA TCTGTGCTGA 
301 CCCCACACAG AAGTGGGTCC AGGACTCTAT GAAGCACCTG GACAAGAAAA 

5 351 CCCAAACTCC AAAGCTTTGA ACATTCATGA CTGAACTAAA AACAAGCCAT 
401 GACTTGAGAA ACAAATAATT TGTATACCCT GTCCTTTCTC AGAGTGGTTC 
451 TGAGATTATT TTAATCTAAT TCTAAGGAAT ATGAGCTTTA TGTAATAATG 
501 TGAATCATGG TTTTTCI 1 A O TAGATTTTAA AAGTTATTAA TATTTTAATT 
551 TAATCTTCCA TGGATTTTGG TGGGTTTTGA ACATAAAOCC TTOGATGTAT 

10 601 ATGTCATCTC AGTGCTGTAA AAACTGTGGG ATGCTCCTCC CTTCTCTACC 
651 TCATGGGGGT ATTGTATAAG TCCTTGCAAG AATCAGTGCA AAGATTTGCT 
701 TTAATTGTTA AGATATGATG TCCCTATGGA AGCATATTGT TATTATATAA 
751 TTACATATTT GCATATGTAT GACTCCCAAA TTTTCACATA AAATAGATTT 
801 TTGTAAAAAA 

15 which corresponds to the cDNA sequence and derived amino acid sequence monocyte 

chemoattractant protein 3 (MCP-3). See: Structural and Functional Identification of Two . 
Human, Tumor-derived Monocyte Chemotactic Proteins (MCP-2 and MCP-3) Belonging to the 
Chemokine Family. Jo Van Damme, Paul Proost, Jean-Pierre Lenaerts, and Ghislain 
Opdenakker. J. Exp. Med. 176: 59-65, 1992. 
20 In another aspect, the present invention provides a heparanase having the amino acid 

sequence (SEQ ID NO: 36) of: 

Met Lys Val Ser Ala Ala Arg Leu Ala Val De Leu He Ala Thr Ala Leu Cys Ala Pro 
Ala Ser Ala Sex Pro Tyr Ser Ser Asp Thr Thr Pro Cys Cys Phe Ala Tyr lie Ala Arg Pro 
Leu Pro Arg Ala His De Lys Glu Tyr Phe Tyr Thr Ser Gly Lys Cys Ser Asn Pro Ala 
25 Val Val Phe Val Thr Arg Lys Asn Arg Gin Val Cys Ala Asn Pro Glu Lys Lys Trp Val 

Arg Glu Tyr lie Asn Ser Leu Glu Met Ser 
encoded by the cDNA sequence (SEQ ID NO: 37) of: 

1 CCTCCGACAG CCTCTCCACA GGTACCATGA AGGTCTCCGC GGCACGCCTC 
51 GCTGTCATCC TCATTGCTAC TGCCCTCTGC GCTCCTGCAT CTGCCTCCOC 
30 101 ATATTCCTCG GACACCACAC CCTGCTGCTT TGCCTACATT GCCCGCCCAC 

151 TGCCCCGTGC CCACATCAAG GAGTATTTCT ACACCAGTGG CAAGTGCTCC 
201 AACCCAGCAG TCGTCTTTGT CACCCGAAAG AACCGCCAAG TGTGTGCCAA 
251 CCCAGAGAAG AAATGGGTTC GGGAGTACAT CAACTCTTTG GAGATGAGCT 
301 AGGATGGAGA GTCCTTGAAC CTGAACTTAC ACAAATTTGC CTGTTTCTGC 
35 351 TTGCTCTTGT CCTAGCTTGG GAGGCTTCCC CTCACTATCC TACCCCACCC 

401 GCTCCTTGAA GGGCCCAGAT TCTGACCACG ACGAGCAGCA GTTACAAAAA 
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451 CCTTCCCCAG GCTGGACGTG GTGGCTCAGC CTTGTAATCC CAGCACTTTG 
501 GGAGGCCAAG GTGGGTGGAT CACTTGAGGT CAGGAGTTCG AGACAGCCTG 
551 GCCAACATGA TGAAACCCCA TGTGTACTAA AAATACAAAA AATTAGCCGG 
601 GCGTGGTAGC GGGCG CCTGT AGTCCCAGCT ACTCGGGAGG CTGAGGCAGG 
5 651 AGAATGGCGT GAACCCGGGA GCGGAGCTTG CAGTGAGCCG AGATCGCGCC 
701 ACTGCACTCC AGCCTGGGCG ACAGAGCGAG ACTCCGTCTC AAAAAAAAAA 
751 AAAAAAAAAA AAAAAATACA AAAATTAGCC GCGTGGTGGC CCACGCCTGT 
801 AATCCCAGCT ACTCGGGAGG CTAAGGCAGG AAAATTGTTT GAACCCAGGA 
851 GGTGGAGGCT GCAGTGAGCT GAGATTGTGC CACTTCACTC CAGCCTGGGT 
10 901 GACAAAGTGA GACTCCGTCA CAACAACAAC AACAAAAAGC TTCCCCAACT 
951 AAAGCCTAGA AGAGCTTCTG AGGCGCTGCT TTGTCAAAAG GAAGTCTCTA 
1001 GGTTCTGAGC TCTGGCTTTG CCTTGGCTTT GCAAGGGCTC TGTGACAAGG 
1051 AAGGAAGTCA GCATGCCTCT AGAGGCAAGG AAGGGAGGAA CACTGCACTC 
1101 TTAAGCTTCC GCCGTCTCAA CCCCTCACAG GAGCTTACTG GCAAACATGA 
15 1151 AAAATCGGGG 

which corresponds to the cDNA sequence and derived amino acid sequence Human T 
cell-specific protein (RANTES). See Schall et al.. J. Immunol. 141, 1018-1025 (1988). 

In another aspect, the present invention provides a heparanase having the amino acid 
sequence (SEQ ID NO: 38) of: 
20 Met Arg He Ser Ala Thr Leu Leu Cys Leu Leu Leu He Ala Ala Ala Phe Ser De Gin Val 

Trp Ala Gin Pro Asp Gly Pro Asn Ala Ser Thr Cys Cys Tyr Val Lys Lys Gin Lys De 
Pro Lys Arg Asn Leu Lys Ser Tyr Arg Arg De Thr Ser Ser Arg Cys Pro Tip Glu Ala 
Val lie Phe Lys Thr Lys Lys Gly Met Glu Val Cys Arg Glu Ala His Gin Lys Trp Val 
Glu Glu Ala lie Ala Tyr Leu Asp Met Lys Thr Pro Thr Pro Lys Pro 
25 encoded by the cDNA sequence (SEQ ID NO: 39) of: 

1 ACTGAAGCCA GCTCTCTCAC TCTCTTTCTC CACCATGAGG ATCTCTGCCA 
51 CGCTTCTGTG CCTGCTGCTC ATAGCCGCTG CTTTCAGCAT CCAAGTGTGG 
101 GCCCAACCAG ATGGGCCCAA TGCATCCACA TGCTGCTATG TCAAGAAACA 
151 AAAGATCCCC AAGAGGAATC TCAAGAGCTA CAGAAGGATC ACCAGTAGTC 
30 201 GGTGTCCCTG GGAAGCTGTT ATCTTCAAGA CAAAGAAGGG CATGGAAGTC 
251 TGTCGTGAAG CCCATCAGAA GTGGGTCGAG GAGGCTATAG CATACTTAGA 
301 CATGAAAACC CCAACTCCAA AGCCTTGAAG AAATGTGCCT GAACAGAAAC 
351 CAACCTAGGA GCCAAGAAGC AAAAATTCCT CACCGCTGTT CTTTCTGAGA 
401 ACTGTTGATG AAATGTGTTG ATCACGGTCC TAAGGGATAG GAGCTGTCTG 
35 451 TAGGAATGTG AAACAGTCAC GCCTAAGGAA TGGTCTTTAA GTTATTAATA 
501 TTTTTATTTA ATTAGCCATG TACTTTGGTG TGATTTGAAT GTAAAGCTCT 
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551 GGAGACCTCA TGTCACTTTA ACATTGTGTT AGCTGCAGAA TTC 
which corresponds to the cDNA sequence and derived amino acid sequence human fic (growth 
factor-activated gene). See Heinrich et al„ Molecular and Cellular Biology 13: 2020-2030, 
1993. 

5 In another aspect, the present invention provides a heparanase having the amino acid 

sequence (SEQ ID NO: 40) of: 

Asp Ser Val Ser lie Phe lie Thr Cys Cys Phe Asn Val lie Asn Arg Lys lie Pro He Gin 
Arg Leu Glu Ser Tyr Thr Arg He Thr Asn He Gin Cys Pro Lys Glu Ala Val He Phe Lys 
Thr Gly Lys Glu Val Cys Ala Asp Pro Lys Glu Arg Trp Val Arg Asp Ser Met Lys His 

10 Lys Asp Gin He Phe Gin Asn Leu Lys Pro 

which corresponds to the cDNA sequence and derived amino acid sequence monocyte 
chemoattractant protein 2 (MCP-2). See VanDarnme et al., J. Exp. Med. 176: 59 - 65, 1992. 

The purified heparanase of the present invention, allows for the convenient selection of 
compounds having anti-heparanase activity (AHA compounds), i.e. inhibitors of heparanase 

15 activity (IHA), by measuring inhibition of heparanase activity. Inhibition of heparanase activity 
can be measured utilizing in vivo radiolabeled heparan sulfate/heparin. This ligand is 
radiolabeled to high specific activity by intraperitoneal injection of 0.5mCi of S-35 sulfate into 
C57 mice bearing a 1-2 cm basement membrane tumor (EHS; Engelbreth, Holm, Swarm tumor). 
The tumor is harvested after 16 hours and the heparan sulfate proteoglycan extracted in 4 

20 volumes of 6M urea, 20mM Tris pH 6.8, protease inhibitors, 0.1 5M NaCl and 0.5% triton X- 
100. The urea extract is chromatographed on an anion exchange column and the proteoglycan is 
eluted in a linear gradient of NaCL Hie radiolabeled proteoglycan is exchanged into a solution 
of 4.0M guanidine-HCI, 20mM Tris pH 7.4 and applied to a size exclusion column. The 
proteoglycan peak is pooled and exchanged into 0.1 5mM NaCl and 20mM Tris pH7.4. 

25 Purified, radiolabeled proteoglycan is coupled to commercially available agarose support 

A quantitative assay of heparanase activity is constructed with the radiolabeled ligand in a multi- 
well format. Briefly, known quantities of recombinant heparanase are added to a multi-well 
plate containing equal amounts of radiolabeled ligand in each well. Enzyme-ligand interaction 
proceeds overnight and the ligand-agarose complex is recovered by cantrifugation. Radioactivity 

30 in the liquid phase is determined by scintillation counting and is the measure of enzyme activity. 
Potential enzyme inhibitors can be evaluated by adding the compound to the solution phase or 
alternatively adding the assay components to multi-weU plates containing preweighed amounts 
of test compound. 

In addition, the purified heparanase of the subject invention can be used for therapeutic 
35 wound healing or can be immobilized onto filters and used to degrade heparin from the blood of 
patients post-surgery. 
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Wound treatment can be achieved by administration to an afflicted individual an 
effective amount of a pharmaceutical composition comprising the purified heparanase in 
combination with a pharmaceutically acceptable, preferably slow releasing, carrier. See. e.g. 
PCT/US90/04772, incorporated herein by reference. 
5 Immobilization onto filters can be achieved by the methods well known in the art 

including those disclosed by. Langer et al. in Biomaterials: Inter-facial Phenomenon and 
Applications, eds. Cooper et al, pp 493-509, 1982 and those described in U.S. Patent No. 
4,373,023, 4,863,611 and 5,211,850 (all incorporated herein by reference). 

The purified heparanase of the subject invention can be prepared by the method 
10 described in procedure A or procedure B, but preferably procedure A. 

PROCEDURE A 

Reverse transcription of the mRNA from activated human leukocyte-derived cells 
[preferably lymphocytes, neutrophils, platelets, Jurkatt lymphoma cells, Dami cells (Greenberg et 
al.. Blood 72:1968-1977, (1988)] is used to prepare the cDNA for the desired heparanase 

15 enzyme (preferably SEQ. ID. NO: 1; optionally SEQ* ID. NO: 3, SEQ. ID. NO: 5, SEQ. ID. 
NO: 7; SEQ. ID. NO: 13, SEQ. ID. NO: 15, SEQ. ID. NO: 17, SEQ. ID. NO: 19, SEQ. ID. 
NO: 21, SEQ. ID. NO: 23. SEQ. ID. NO: 25, SEQ. ID. NO: 27, SEQ. ID. NO: 29; SEQ. ID. 
NO: 31, SEQ. ID. NOs 33 or SEQ. ID. NO: 35), employing standard PCR cloning techniques 
(described in Sambrook et al., in: Molecular Cloning, A Laboratory Manual Second Edition, 

20 1989. Cold Spring Harbor Press). The cDNA encoding the heparanase enzyme is cloned into 
Xbal/BamHl sites in the commercially available baculovirus vector pVL 1392 (Pharmingen; 
San Diego, CA). High titer infectious virus is selected for use in infecting sf9 insect cells 
(Luckow and Summers, Bio/Technology. 6,47 1988). Serum-free medium conditioned by 
infected sfP cells is collected after 72 hours. This media is the starting material for purification 

25 of recombinant heparanase. Serum-free conditioned media is adjusted to contain 20mM Sodium 
Acetate, pH 5.0, 0.15M NaCl, ImM reduced glutathione (GSH), ImM dithiothrcitol (DTT) and 
lOmM beta-octylglucoside. Medium is applied to a column of cation-exchange resin 
(Pharmacia) and eluted from the column in a linear gradient of NaCl. Fractions containing 
heparanase are pooled and diluted to a final salt concentration of 0.1 5M NaCl. To this solution 

30 is added 20mM Tris and the pH adjusted to 7.0. The solution is applied to a column of 

heparin-Sepharose (Pharmacia) and eluted with a linear salt gradient buffered to pH 5.0 with 
20mM Sodium Acetate. Heparanase is concentrated to 0.5mg/ml in an Amicon concentrator 
fitted with a YM-2 membrane and stored at -80 degrees. For optimal activity (greater than 50 
units heparanase activity per u% protein) incubation in the presence of transglutaminase, under 

35 reducing conditions, in accordance with the procedure in Example 2, Part C. 
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PROCEDUREB 

This procedure describes the purification to homogeneity of heparanase (SEQ. ID. NO: 
1) from human blood cells or cell lines (such as platelets) under reducing conditions which 
allow for the occurrence of post-translational modifications that increase the specific activity of 

5 heparanase and make it suitable for use in the above described screening assay. The cells are 
treated with a suitable activator (such as, but not limited to, thrombin or histamine) which 
allows for the release of enzymes and cytokines from the cell. Reducing agents are added to the 
supernatant from the activated cells. Suitable reducing agents include dithiothreitol (DTT), 
dithioerythritol (DTE), reduced glutathione (GSH), and P-mercaptoethanol. The reduced, 

10 activated supernatant is chromatographed on a column of immobilized heparin or heparan sulfate 
under reducing conditions at pH 5, using a salt gradient (such as NaCl, KC1, or other salt) to 
elute the bound proteins. Fractions containing heparanase activity are pooled and exchanged 
into any buffer appropriate for the pH of 6.8 and containing 0.15 M NaCl, reducing agents, and 
non-ionic detergent. This is passed over any suitable anion-exchange column (bed volume of 5 

15 ml or less). The unbound material from this column is adjusted to pH 5 with acid, and is 

loaded onto any suitable cation-exchange column (bed volume of 5 ml or less), equilibrated in a 
suitable pH 5 buffer containing 0.15 M NaCl, reducing agents, and non-ionic detergents. The 
bound protein is eluted from the column with a salt gradient, and the fractions containing 
heparanase activity are pooled and size fractionated to below 30,000 daltons with 30 K-cut-off 

20 membranes. The protein below 30,000 daltons is concentrated by either heparin-sepharose 
chromatography or by centrifugation through 5 K-cut-off membranes. 

The present invention is seen more fully by the examples set forth below. 
Example 1: Use of Heparanase as a screen for AHA compounds. 

1. Heparan sulfate, metabolically labeled (S-35) to a high-specific activity- as described 
25 above for the EHS tumor, prepared by papain digestion of chromatographically purified heparan 

sulfate proteoglycan is coupled to cyanogen bromide activated Sepharose-6B (Pharmacia) 
according to manufacturer's instructions. 

2. 35 S-Heparan sulfate-Sepharose 6B is resuspended in: 0.15 M NaCl, 0.03% human 
serum albumin, 10 pM MgCl 2 , 10 pM CaCl 2 , antiproteolytic agents (1 pg/ml leupeptin, 2 pg/ml 

30 antipain, 10 pg/ml benzamidine, 10 units/ml aprotinin, 1 pg/ml chymostatin, and 1 pg/ml 
pepstatin), and 0.05 M Na acetate, pH 5.6 and 5,000 cpm, in a total volume of 200 pi, are 
aliquoted into each well of a 96 well plate. To each well is added 5 units of activated 
heparanase and the digestion allowed to proceed overnight at 37 degrees. 

3. Separation of digested product is accomplished by centrifugation of the 96 well 
35 plate. The supernatant, containing cleaved heparan sulfate is decanted and quantitated by 

scintillation counting. 
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4. Inhibitors of heparanase activity can be introduced into the liquid-phase of the 

assay. 

5. A potential inhibitor of heparanase activity would be identified by its ability to 
reduce the amount of radiolabeled heparan sulfate released into the supernatant by 50% at a 

5 concentration of 1 pM or less. 

Example 2: The preparation of heparanase under reducing conditions as outlined in 

Procedure B. 

Part A: 

Platelet-rich plasma (lti 9 platelets/ml; 1800 ml) is obtained from healthy, informed 
10 volunteers by plasmapheresis. The plasma is removed from the platelets by centrifugation 

(Heldin, et al, Exp. Cell Res. 109: 429-437, 1977). Platelets suspended in phosphate buffered 
saline (PBS; 0.1 original volume) are then stimulated with 1 U/ml thrombin for 5 min at 37°C. 
This concentration of thrombin was reported to release 100% of the heparanase activity from 
platelets (Oldberg, et al, Biochemistry 19: 5755-5762, 1980). Following activation, the 
15 thrombin is inactivated by the addition of 100 mM phenylmethylsulfooylfluoride (PMSF), and 
the platelets are centrifuged at 2000 x g for 30 min at 4°C. The supernatant is stored at -80°C 
until used for the chromatographic purification of heparanase (Part B). 
Part B: Chromatographic purification of heparanase. 

1. Heparin-Sepharose Chromatography. Activated platelet supematants are pooled and 
20 adjusted to contain 1 mM GSH and 1 mM DTT. This pool is loaded (2.5 ml/min) onto a 

column of heparin-sepharose (2.6 x 7.5 cm, 40 ml) equilibrated in 1 mM GSH, 1 mM DTT, 150 
mM NaCl, 10 mM NaP0 4 , pH 7.4. After loading the sample, the column is washed with 200 
ml of 0.15 M NaCl, 1 mM GSH, 1 mM DTT, 10 mM Na acetate, pH 5, followed by 60 ml of 
0.35 M NaCl, 1 mM DTT, 1 mM GSH, 10 mM Na acetate, pH 5. The column is then eluted 
25 with a 160 ml linear gradient between 0.35 M NaCl and 1.5 M NaCl in the same buffer. 

Aliquots of each fraction are used for determination of heparanase activity by the ''Purification 
Assay" described later. 

2. Anion-exchange chromatography (For example, DEAE-Sephacel, Pharmacia). The 0.9 
M - 1.15 M NaCl fractions from the heparin-sepharose column are concentrated using a stirred 

30 cell fitted with a PM-10 membrane, and the buffer is exchanged to 0.15 M NaCl, 1 mM DTT, 1 
mM GSH, 10 mM 0-octylglucoside, 10 mM sodium phosphate, pH 6.8 (8 ml). This sample is 
loaded onto a 5 ml column of DEAE-Sephacel equilibrated in the same buffer. After loading, 
the column is washed to baseline absorbance (280 nm) with the equilibration buffer. The flow- 
through and wash with equilibration buffer are collected as one pool. The column is then eluted 

35 with 10 ml of 0.15 M NaCl, 10 mM P-octylglucoside, 1 mM GSH. 1 mM DTT, 10 mM Na 
acetate, pH 5. followed by 10 ml of 1.5 M NaCl, 10 mM fl-octyiglucoside. 1 mM GSH, 1 mM 
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DTT, 10 mM Na acetate, pH 5. Aliquots of each pool are used for determination of heparanase 
activity by the ,r Purification Assay". 

3. Cation Exchange. The unbound sample from the DEAE-Sephacel column is adjusted to 
pH 5 with glacial acetic acid and loaded onto a cation exchange column (Poros HS/F, 4.6 mm x 

5 50 mm; PexSeptive Biosystems), pre-equikbrated with 0.15 M NaCl, 1 mM DTT, 1 mM GSH, 
10 mM [J-octylglucoside, 10 mM Na acetate, pH 5. The flow rate is 3.0 ml/min and 3 ml 
fractions are collected. After washing the column with 35 ml of equilibration buffer, the column 
is developed with a 55 ml linear gradient between 0.15 M and 1.5 M NaCl in the same buffer. 
10 pi aliquots of the fractions are used for determination of heparanase activity by the 
10 ,l Purification Assay". 

4. Size fractionation to < 30 kD and concentration on immobilized heparin (Hi-trap 
heparin-sepharose, Pharmacia). The activity from the Poros HS/F column is size fractionated by 
centrifuging through 30,000 molecular weight cut-off filters (Millipore ultrafree-MC 30,000 
NMWL filter units). The < 30 kD pool is diluted to contain 0.15 M NaCl, and is loaded onto a 

15 1 ml Hi-trap heparin column, pre-equilibrated with 0.15 M NaCl, 1 mM DTT, 1 mM GSH, 10 
mM Na acetate, pH 5. The column is eluted with 1.2 M NaCl in the same buffer and the single 
eluted peak contains the heparanase activity. 

Part C: Properties of the purified heparanase. 

The final yield of heparanase protein from 1850 ml platelet-rich plasma was 2.7 mg. 

20 Protein concentration was determined by the method of Lowry (/. Biol Chem. 193: 265-275, 
1951), or if more precise determinations were required, by amino acid analysis on an amino acid 
analyzer (Beckman 6300). The overall recovery of activity was 8%, with a 4150-fold 
purification. The preparation was judged to be homogeneous by the presence of a single band 
of 9000 daltons on an 18% silver-stained SDS-polyacrylamide gel, run according to the method 

25 ofLaemmli(Afofi<re 227: 680-685,1970). 

The pH optimum of the purified heparanase was determined by conducting the 
•■Purification assay" activity between pH 3.5 and 8.0, using a citrate buffer (pH 3.5 - 6.0), 
citrate-phosphate buffer (pH 6.5 - 7.0), and phosphate buffer (pH 7.5 - 8.). Heparanase was 
active between pH 5.0 and 8.0, with the optimum pH at 5.8. 

30 N-terminal amino acid sequencing of heparanase produced by this procedure was 

performed using a gas/liquid phase Protein Sequencer (Applied Biosystems Inc. Model 470). 
Phenylthiodantoin amino acids were resolved and quantitated by an on-line HPLC system 
(Model 120, Applied Biosystems Inc.) with data analysis on a Nelson Analytical System. N- 
terminal amino acid sequences of the heparanase produced in this example were 85 % 

35 SEQ. ID. NO: 9 (namely: 



WO 95/04158 



PCT/US94/08207 



-25- 

Asn Leu Ala Lys Gly Lys Glu Glu Ser Leu Asp Ser Asp Leu Tyr Ala 
1 5 10 15 

Glu Leu Arg)» 

which is identical to CTAP-m, and 15% SEQ. ID. NO: 10 (namely: 

5 Ser Sex Thr Lys Gly Gin Thr Lys Arg Asn Leu Ala Lys Gly Lys Glu), 
1 5 10 15 

which is the precursor form, platelet basic protein. Interestingly, the N-terminal sequence of 
commercial P-thromboglobulin (namely, Calbiochem (Cat. # 605165), Celsus Laboratories (Cat 
# 41705), and Haematologic Technologies (Cat. # HBTG-0210), which has low levels of 
10 heparanase activity, was 100% SEQ. ID. NO: 11 (namely: 

Asn Leu Ala Lys Gly Lys Glu Glu Ser Leu Asp Ser Asp Leu Tyr Ala Glu), 

1 .inuS 10 J5 

indicating thata&e commercial preparation is actually CTAP-m and not P-thromboglobulin. 

Chromatofocusing of the heparanase produced by this procedure results in two peaks of 
15 differing isoelectric points. To perform the chromatofocusing, heparanase is dissolved in 0.025 
M imidazole, pH 7.3. The sample is loaded onto a OS x 20 cm column of Polybuffer 
Exchanger 94 (Pharmacia), equilibrated with 0.025 M imidazole, pH 7.3. Immediately after 
sample loading, Polybuffer 74 (Pharmacia; 1 :8, pH 4) is pumped onto the column at 03 ml/min. 

2 ml fractions are collected, and the pH of each fraction is determined by a narrow range (pH 4- 
20 7) pH paper. Aliquots of each fraction are used to determine heparanase activity by the 

"Purification Assay. 1 * All of the activity is associated with an absorbance (280 nm) peak that 
eluted at pH 4.8 to 5.1, representing approximately 10% of total protein, while 90% of the 
protein is eluted at pH 7.3 and is inactive. Aliquots of each protein peak are separated from the 
ampholytes by C 4 reverse phase chromatography. The peak that eluted from the chrornato- 

25 focusing column at pH 7.3 has terminal sequences for platelet basic protein and the processed 
form, CTAP-m. The peak that is eluted from the chromatofocusing column at pH 4.8 - 5.1 also 
contains the sequences of platelet basic protein and the processed form, CTAP-m. All of the 
platelet basic protein processed forms have pl*s that are calculated and reported to be greater 
than 7.6. Thus, the heparanase activity resides in the platelet basic protein and/or the processed 

30 form, CTAP-m that is modified such that the pi is lowered to 4.8 - 5.1. 

Heparanase obtained after diromatofocusing exhibited a specific activity of 80 unitr \ig 
protein, using the "Purification Assay." This represents a 1000-fold increase in the specific 
activity compared to the commercial protein (f^thromboglobulin; 0.075 units/ug protein). 
The modification that may be responsible for the lower isoelectric point of active 

35 heparanase is ADP-ribosylation. ADP-ribosylation (Adenine diphosphate-ribosylation) is a post- 
translational modification of proteins or DNA in which the ADP-ribose group of NAD 
(Nicotinamide adenine dinucleotide) is enzymatically transferred to proteins or DNA. Since this 
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modification adds two negatively charged phosphate groups to a molecule, it would result in a 
lower isoelectric point. Activated platelet supernatants were incubated in 1 mM DTT, 2 mM 
MgCl 2 * 100 mM HEPES, pH 7.4, and 0.5 uM [ 32 P]NAD (Specific activity = 1000 Ci/mmol). 
The labeled proteins were separated by SDS-polyacrylamide gel electrophoresis on an 18% gel, 
5 transferred to PVDF (polyvinylidene difluoride) membrane, and exposed to X-ray film. The 
autoradiogram demonstrates the incorporation of [ 32 P] into a protein of 8000 daltons. The 
PVDF membrane was immunoblotted with the anti-Peptide C antisera (1:1500 in PBS 
containing 5% dry milk, 0.05% Tween-20, 0.15 M NaQ, 20 mM Tris, pH 7.4, 2 hours at room 
temperature, followed by incubation with peroxidase-labeled goat anti-chicken IgG (1:500 in 

10 above buffer, 1 hour room temperature), and reacted with a peroxidase substrate. The 

immunoblot revealed that the 8000 dalton that was labeled with [ 32 P] was CTAP-m/heparanase. 
The addition of 200 uM sodium nitroprusside, a spontaneous releaser of nitric oxide, to the 
ADP-ribosylation reaction resulted in 5-fold more incorporation of [ 32 P] label into CTAP- 
m/heparanase, suggesting that this modification can be regulated in vivo by nitric oxide. 

15 Finally, in an an algous manner to that of glyceraldehyde-3-phosphate dehydrogenase, another 
platelet ADP-ribosylated glycolytic enzyme (Zhang and Snyder, Proc. Natl Acad. ScL USA 89: 
9382-9385), it was determined that CTAP-m/heparanase has an auto- ADP-ribosylation activity, 
since the [ 32 P]-ADP-ribosylation of CTAP-m/heparanase occurs in reactions where the only 
protein present is commercial CTAP-m or purified heparanase. Other chemokine family 

20 members tested, which includes DM-, IL-8, gro-a, and MCAF, also have auto- ADP-ribosylation 
activity. 

It is contemplated that the high specific activity of CTAP-m/heparanase is a 
consequence of ADP-ribosylation of the enzyme in the presence of nitric oxide. It is further 
contemplated that the action of transglutaminase on the ADP-ribosylated enzyme will lead to 

25 further increase in the specific activity. 

An amino acid composition of the heparanase produced in Example 2 gave the expected 
amino acid composition for CTAP-m and N-terminal sequencing revealed sequences for platelet 
basic protein and the processed form, CTAP-m , confirming that the heparanase activity is 
contained in this set of processed proteins and is not due to a minor contaminant. The presence 

30 of heparanase activity in three commercial sources of P-thromboglobulin also confirms this 
conclusion. In addition, polyclonal antibodies to P-thromboglobulin were found to precipitate 
30 - 70% of the heparanase activity in three separate experiments, providing additional 
confirmation. 

The activation of heparanase with transglutaminase (prepared in accordance with 
35 Example 2, Part B) results in a substantial (about 13 -fold) increase in the specific activity of the 
enzyme. The heparanase (2 u\ at 56 nM) obtained by Example 2, Part B is treated with either 
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transglutaminase from guinea pig liver (4 mU; Sigma) or with Factor XIII (1 ug; Celsus 
Laboratories, Inc.). the blood coagulation factor that is activated by treatment with 5 units of 
thrombin at 37 degrees for 30 minutes. Heparanase is activated by incubation of either 2mU 
liver transglutaminase or 5 units of activated Factor XTQ in the presence of 0.1M NaAcetate 
5 buffer at pH 6.0 containing lmM reduced glutathione and ImM CaCl for 35 minutes at 37 
degrees. Treatment of heparanase with either type of transglutarninase results in a substantial 
increase in the specific activity of the heparanase. 

The high degree of sequence identity between CTAP-m and lnterleukin-8, a CXC 
chemokine family member, assures that an essentially identical folding pattern will be shared by 

10 the two proteins. Since the 3-dimensional structure of Interleukin-8 is known (Clore, et al., 
Biochemistry 29: 1689-1696,1990; Baldwin, et aL, J. Biol Chem. 265: 6851-6853), one can 
model the same for CTAP-m. Such a model can serve to direct n • -arch into rationally 
designed IHA and to help explain the action of transglutaminase in activating the CTAP-HL 
Part D: Purification Assay for Heparanase Activity 

15 Heparanase activity from platelets or column fractions is detected by its ability to digest 

the £ 70 kD 35 S-HSPG to produce lower molecular weight products. Each digest contains 10 ul 
sample, 35 S-HSPG (2000 cpm), 0.15 M Nad, 0.03% human serum albumin, 10 uM MgCl 2 , 10 
uM CaCl 2 , antiproteolytic agents (1 ug/ml leupeptin, 2 ug/ml antipain, 10 ug/ml benzamidine, 
10 units/ml aprotinin, 1 ug/ml chymostatin, and 1 ug/ml pepstatin), and 0.05 M Na acetate, pH 

20 5.6 in a total volume of 300 pi. Digests are carried out for 3 to 21 h. The presence of lower 
molecular weight radiolabeled products is detected by centrifugation through 30,000 MW-cutoff 
filters. The digests containing 2000 cpm of 35 S-HSPG (> 70 K) are centrifuged through 30,000 
molecular weight cut-off filters (Millipore Ultrafree-MC 30,000 NMWL filter units). 35 S-HSPG 
degradation is evident by the presence of radioactivity in the filtrate that passed through the 30 

25 K membrane; this heparanase activity is expressed as the % of total cpm < 30 K for a given 
digest Analysis of heparan sulfate degradation by this method is quick and reproducible. 1 
unit of heparanase activity is defined as 1% cpm < 30 K per h. For pH optimum deterrnination, 
the 0.1 M Na acetate buffer is replaced by 50 mM citrate, citrate-phosphate, or phosphate buffer 
at varying pH's. For samples from chromatographic steps performed under reducing conditions 

30 (1 mM GSH, 1 mM DTT), the concentration of a thiol oxidant (diamide) needed for optimum 
activity is determined. This concentration (100 pM diamide) is added to all assay tubes when 
reduced samples are assayed. 

Preparation of 3S S-HSPG (>70 K) for use in the 'Purification Assay." 

35 S-HSPG (>70 K) is prepared from mice bearing a basement membrane rumor that 
35 overproduces HSPG (EHS tumor), using modifications of the method of Ledbetter, et. al., 1987. 
Briefly, the radiolabeled HSPG was prepared by injecting C57BL mice bearing the EHS tumor 
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(Orkin, et.al., 1977) with sodium [ 35 S]sulfate (0.5 raCi/mouse) 18 h before harvesting the tumor. 
The HSPG is extracted from the weighed tumor with 6 volumes (w/v) of Buffer A (3.4 M NaCl, 
0.1 M 6-aminohexanoic acid t 0.04 M EDTA, 0.008 M ^-ethylmaleimide, 0.002 M PMSF, and 
0.05 M Tris-HCl, pH 6.8), by homogenization with a Polytron for 30 s, foUowed by stiiring at 
5 4°C for 1 h. Insoluble material is collected by centrifugation (12,000 x g for 10 ruin), and the 
supernatant is discarded The insoluble residue is reextracted with 2 volumes (original tumor 
weight) of Buffer A for 30 min with stirring at 4°C. Insoluble material is again collected by 
centrifugation, and the supernatant fraction is discarded. The insoluble material is then 
suspended in 6 volumes of Buffer B (6 M urea, 0.1 M 6-aminohexanoic acid, 0.04 M 

10 emylenediaminetetraacetic acid (EDTA), 0.002 M PMSF, and 0.05 M Tris-HCl, pH 6.8), 

homogenized with an electric homogenizer (Polytron) for 30 s, and stirred for 2 h at 4°C. The 
mixture is centrifuged to remove insoluble material, and the supernatant is retained. The 
insoluble material is reextracted with 2 volumes of Buffer B. The mixture is centrifuged, and 
the supernatant is combined with the previous supernatant. 

15 35 S-HSPG is isolated from the Buffer B supernatant by sequential chromatography on 

anion exchange and gel filtration columns. The Buffer B supernatant is dialyzed overnight 
against 10 volumes of 6 M urea, 0.15 M Nad, 0.05 M Tris-HCl, pH 6.8, and is adjusted to 
contain 0.5% non-ionic detergent (Triton X-100). This supernatant (from 1 1 g tumor) is 
chromatographed on a 30 ml column of anion exchange resin (DEAE-Sephacel) equilibrated 

20 with 6 M urea, 0.15 M NaCl, 0.05% Triton X-100, 0.05 M Tris-HCl, pH 6.8. After loading the 
supernatant and washing with the equilibration buffer, the column is developed with a 250 ml 
linear gradient between 0.15 M NaCl and 1.15 M NaCl (Flow - 2.0 ml/min). Fractions are 
sampled for radioactivity, and those containing the 35 S0 4 label that elutes from the DEAE- 
Sephacel between 0.4 M and 0.8 M NaCl are pooled. The proteoglycan is precipitated by the 

25 addition of 4 volumes of 100% EtOH at -20°C overnight The precipitate is collected by 

centrifugation and is solubilized in 1 ml of Buffer C (4 M Gu-HCl, 20 mM Tris-HCl, pH 7.2). 
This solubilized pellet is used for chromatography on a calibrated gel filtration column (1.0 x 50 
cm column of Superose 6; Pharmacia) equilibrated in Buffer C (Flow « 05 ml/min). Fractions 
are sampled for radioactivity, and those containing the 3S S0 4 label that elutes with a molecular 

30 weight £ 70 kD were pooled. The proteoglycan is precipitated with 100% EtOH as described 
above. The pellet is dissolved in 3 ml PBS, and dialyzed against 3 x 100 volumes of PBS. 
Each preparation of 35 S-HSPG is confirmed to be > 98% heparan sulfate by susceptibility to 
low pH nitrous acid degradation (Shiveley and Conrad, Biochemistry 15: 3932-3942, 1976). 
Example 3: Preparation of cDNA encoding Heparanase. 

35 Media is removed from cultured HEL (HEL 92.1.7; Human erythroleukemia; ATCC No. 

TIB 180) cells stimulated with lOnM phorbol 12-myristate 13 -acetate (Sigma Chemical Co., St 
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Louis, MO) and the cells scraped from the dish and pelleted by centrifugation. The pellet is 
extracted with 200 wl of TRI reagent (Molecular Research Center Inc. Cincinnati, OH) and the 
total cellular RNA is prepared according to the manufacturer's instructions. To prepare first 
strand synthesis the reverse transcriptase reaction was performed with lOwl of total cellular RNA 

5 in the presence of 4id of 5x transcriptase buffer (Bethesda Research Laboratories, Gaithersburg, 
MD), lul 0.2mM DTT, 4u\ random hexanucleotides (Amersham Corp. Arlington Heights, ILL), 
and lid lOmM dNTP (BRL). This solution is heated to 95 degrees C f or 5 minutes and then 
placed on ice. To this is added lid RNAsin and lid reverse transcriptase (M-MLV), (Promega, 
Madison WI). This is incubated at 37 degrees for 60 minutes and then placed on ice. The 

10 polymerase chain reaction is carried out as follows. To 3«1 of the first strand (above) is added 
lid of each Primer (see below), 77«1 of water lOwl lOx PCR buffer (Perkin Elmer Cetus, 
Norwalk CT) and 2u\ each dNTP. This solution is heat denatured at 95 degrees C and lid 
Amplitaq DNA polymerase (Perkin Elmer Cetus) is added. Hybridization temperature begins at 
72 degrees and is lowered by one degree per cycle until reaching 55 degrees. Each 

15 hybridization step is followed with a constant elongation temperature of 72 degrees. Upon 
completion the solution is left at 0 degrees until storage at -20 degrees. The products of the 
PCR reaction are electrophoresed on 3% NuSieve, 1% agarose gels and bands of expected size 
are excised and purified by standard procedures. 
Primers: 

20 Platelet Basic Protein: TGG ACT ACT ATG TCC TCC ACC AAA GGA CAA ACT AA 
CTAP m: TOG ACT AGT ATG AAC TTG GOG AAA GAG GA 
B-thrombglobulin: TGG ACT AGT ATG GGC AAA GAG GAA AGT CTA GAC AG 
NAP-2: TGG ACT AGT ATG GAA CTC CGC TGC ATG TGT ATA AA 
Example 4: Preparation of cDNA encoding Heparanase. 

25 Media is removed from cultured leukocyte-derived cells [e.g., lymphocytes, neutrophils, 

platelets, Jurkatt lymphoma cells, Dami cells (Greenberg et al., Blood 72:1968-1977, (1988)], 
stimulated with Concanavalin A or phorbol 12-myristate 13 acetate (Sigma Chemical Co., St. 
Louis, MO) and the cells scraped from the dish and pelleted by centrifugation. The pellet is 
extracted with 200«l of TRI reagent (Molecular Research Center Inc. Cincinnati, OH) and the 

30 total cellular RNA is prepared according to the manufacturer's instructions. To prepare first 

strand synthesis the reverse transcriptase reaction was performed with lOid of total cellular RNA 
in the presence of 4«1 of 5x transcriptase buffer (Bethesda Research Laboratories, Gaithersburg, 
MD), lid 0.2mM DTT, 4id random hexanucleotides (Amersham Corp. Arlington Heights, ILL), 
and lid lOmM dNTP (BRL). This solution is heated to 95 degrees C for 5 minutes and then 

35 placed on ice. To this is added lid RNAsin and lid reverse transcriptase (M-MLV), (Promega, 
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Madison WI). This is incubated at 37 degrees for 60 minutes and then placed on ice. The 
polymerase chain reaction is carried out as follows. To 3«I of the first strand (above) is added 
lal of each Primer (see below), llu\ of water lOwl lOx PCR buffer (Perkin Elmer Cetus, 
Norwalk CT) and 2ul each dNTP. This solution is heat denatured at 95 degrees C and lid 
5 Amplitaq DNA polymerase (Perkin Elmer Cetus) is added. Hybridization temperature begins at 
72 degrees and is lowered by one degree per cycle until reaching 55 degrees. Each 
hybridization step is followed with a constant elongation temperature of 72 degrees. Upon 
completion the solution is left at 0 degrees until storage at -20 degrees. The products of the 
PCR reaction are electrophoresed on 3% NuSieve, \% agarose gels and bands of expected size 
10 are excised and purified by standard procedures. 
Primers: 

Platelet Basic Protein: TGG ACT AGT ATG TCC TCC ACC AAA GGA CAA ACT AA 

CTAP IE: TGG ACT AGT ATG AAC TTG GCG AAA GAG GA 

B-thrombglobulin: TGG ACT AGT ATG GGC AAA GAG GAA AGT CTA GAC AG 

15 NAP-2: TGG ACT AGT ATG GAA CTC CGC TGC ATG TGT ATA AA 

All temperatures expressed throughout the subject specification are in degrees Centigrade. 

The cDNA encoding heparanase is preferably cloned into a vector designed for 
expression in eukaryotic cells, rather than into a vector designed for expression in prokaryotic 
cells (e.g. E. coli). Eukaryotic cells are preferred for expression of genes obtained from higher 

20 eukaryotes because the signals for synthesis, processing, and secretion of these proteins are 
usually recognized, whereas this is often not true for prokaryotic hosts (Ausubel, et aL, ed., in 
Short Protocols in Molecular Biology, 2nd edition, John Wiley & Sons, publishers, pg.16-49, 
1992.). Eukaryotic hosts may include, but are not limited to, the following: insect cells, 
African green monkey kidney cells (COS cells), Chinese hamster ovary cells (CHO cells), and 

25 Murine 3T3 fibroblasts. 

Experiments demonstrating that a synthetic peptide of CTAP-m/NAP-2 or antisera 
raised against a synthetic peptide of CTAP-m/NAP-2 inhibit the heparanase activity of CTAP- 
m/NAP-2 suggest that the amino acids participating in enzymatic catalysis are contained in a C- 
terminal region of the enzyme. 

30 Peptide Synthesis: A C-terminal peptide contained within the sequences known for CTAP-m 
(SEQ ID NO: 1), Platelet Basic Protein (SEQ ID NO: 3), P-thromboglobulin (SEQ ID NO: 5), 
and NAP-2 (SEQ ID NO: 7), and an N-terminal peptide contained within the sequences known 
for CTAP-m (SEQ ID NO: 1) and Platelet Basic Protein (SEQ ID NO: 3) were synthesized 
according to standard procedures. The N-terminal peptide has the following sequence (SEQ ID 

35 NO: 41: Asn Leu Ala Lys Gly Lys Glu Glu Ser Leu Asp Ser Asp Leu Cys, in which the final 
Cys residue was added to regions of known sequence (SEQ ID NOS: 13) for the purpose of 
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conjugation to a carrier protein. The C-terminal peptide has the following sequence (SEQ ID 
NO: 42): Cys Asn Gin Val Glu Val He Ala Thr Leu Lys Asp Gly Arg Lys lie Cys Leu Asp 
Pro Asp Ala Pro Arg lie Lys Lys lie Val Gin Lys Lys encoded by the cDNA sequence (SEQ 
ID NO: 43) of TGCAACCAAG TCGAAOTGAT AGCCACACTG AAGGATGGGA 

5 GGAAAATCTG CCTGGACCCA GATGCTCCCA GAATCAAGAA AATTGTACAG 

AAAAAA. These peptides (SEQ ID NOS: 41 and 42) were produced by stepwise solid phase 
peptide synthesis on an Applied Biosystems 430A Peptide Synthesizer. The 9- 
fluoroenylmethyloxycarbonyl (Fmoc) group was used as the amino protecting group, and 
temporary side-chain protectin groups were as follows: Arg (Prac), Asn (Trt), Asp (OtBu), Gin 

10 (Trt), Glu (OtBu), His (Trt), Lys (Boc), Ser (tBu), Thr (tBu). Each residue was single coupled 
using a HBTU/NMP protocol and capped with acetic anhydride before the next synthesis cycle. 
After removal of the N-terminal Fmoc group, temporary side-chain protecting groups were 
removed and the peptide cleaved from the resin by treatment with 95% TFA/5% scavengers 
(ethyl methyl sulfide/anis. -i 1,2-ethanedithiol, 1:3:1) for two hours at room temperature. The 

15 crude peptides were precipitated from the cleavage solution with cold diethyl ether. The 
precipitated peptide was collected on a sintered glass funnel, washed with diethyl ether, 
dissolved in dilute acetic acid, evaporated to dryness under reduced pressure, and the residue 
was redissolved and lyophillized from glacial acetic acid. The crude peptides were purified by 
preparative reverse phase chromatography on a Phenomenex C-18 column (22.5 x 250 mm) 

20 using a water/acetonitrile gradient, each phase containing 0.1% trifluoracetic acid (TFA). Clean 
fractions, as determined by analytical HPLC, were pooled, the acetonitrile was evaporated under 
reduced pressure, and the aqueous solution was lyophillized. The purified peptides were 
characterized by time of flight or FAB mass spectroscopy. 

Further, SEQ ID NO: 42 can be produced by recombinant DNA methodology as stated 

25 in Procedure A (page 21). 

Antisera Production: The synthetic peptides of CTAP-III/NAP-2 were conjugated to keyhole 
limpet hemocyanin utilizing a maleimide-activated carrier protein (Pierce Chemical Co. #77107). 
300 pg of conjugated peptides were injected into chickens using Freunds complete adjuvant 
The antisera were collected 5 weeks after initial immunization. Specific recognition by the 

30 antisera of commercial CTAP-m (2.5 pg, (Celsus Laboratories Inc., Cincinnati, Ohio; Cat #. 
41705), isolated heparanase (1.5 pg), and 10 ul of the platelet supernatant used for purification 
was achieved by separating the proteins on a reducing 18% polyacrylamide gel (Novex), 
transferring to nitrocellulose, and incubating with the pre-immune or antisera (1:1500), followed 
by incubation with a peroxidase labeled goat anti-chicken IgG (1:500; Kierkegaard and Perry) in 

35 the presence of PBS containing 5% dry milk and 0.05% Tween-20. The pre-immune sera did 
not recognize 7 - 10 kD proteins in the commercial CTAP-m, isolated heparanase, or platelet 
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supematants. 

TnMhftinn of heparanase activity by th e C-terminal synthetic peptide (SEQ ID NO: 42) or 
antisera: For experiments designed to determine whether the peptide antisera was able to 
inhibit heparanase activity, the pre-immune and antisera were exchanged into 0.15M NaCl, 

5 0.01M sodium phoshate buffer, pH 7.4 (PBS) using a 100 kD cut-off membrane in order to 
remove low molecular weight chicken heparanase normally present in the serum. Aliquots of 
isolated heparanase (15 ng) were pre-incubated for 30 min with 2 ul of either pre-immune or 
anti-CTAP-m antisera before adding the 35 S-HSPG to determine heparanase activity. In the 
presence of the pre-immune sera, the isolated protein had 14.3 ± 0.1 units of heparanase activity, 

10 while in the presence of the C-terminal peptide antisera, only 0.8 ± 0.2 units of heparanase were 
detected (p < 0.001; results confirmed in a second experiment). The N-terminal peptide 
antiserum was not able to neutralize the heparanase activity. Similar results were obtained when 
the ability of the synthetic peptides to neutralize heparanase activity was examined. Heparanase 
assays conducted with 3 nM enzyme, 47 nM 35 S-HSPG substrate, and varying concentrations of 

15 peptides showed that heparanase activity was only 5% of control values in the presence of 250 
pM C-terminal peptide. By contrast, heparanase activity in the presence of 250 pM of either the 
N-terminal peptide or an unrelated peptide (PLALWAR) was 67% of control values. The ability 
of both the C-terminal peptide (SEQ ID NO: 42) or antisera from a chicken immunized with the 
C-terminal synthetic peptide to neutralize heparanase activity demonstrates conclusively that 

20 CTAP-in and NAP-2 possess heparanase activity, and suggests that the C-terminal region is 
essential for catalysis. Modeling of this domain (SEQ ID NO: 42) can be used in the 
identification of potent peptide-mimetic compounds capable of inhibiting this enzyme activity. 

Computer assisted modeling can be accomplished using programs for automated docking 
of molecules within 3D databases, as described in DesJarlais, RX., Sheridan, R.P., Seibel, G.L., 

25 Dixon, J.S., Kuntz, IX)., Venkataraghavan, R., 'Using shape complementarity as an initial screen 
in designing ligands for a receptor binding site of known three-dimensional structure 1 '; J. Med. 
Chem. 31:722-729, 1988. Also, automated de novo construction of ligands that can bind the 
catalytic site as described in Moon, J.B.. Howe, W J., '-Computer design of bioactive molecules: 
a method for receptor-based de novo ligand design 11 ; Proteins: Struct, Funct, and Genetics, 

30 11:314-328, 1981. 
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SEQUENCE LISTING 

5 (1) GENERAL INFORMATION: 

<i) APPLICANT: Hoogewerf, Arlene J. 

Ledbetter, Steven R. 

10 <ii) TITLE OP INVENTION: USE OP HEPARAN ASE TO IDENTIFY AND 

ISOLATE ANT I -HEPARANASE COMPOUNDS 

(iii) NUMBER OF SEQUENCES: 43 

15 (iv) CORRESPONDENCE ADDRESS: 

(A) ADDRESSEE: The Upjohn Company, Intellectual Property Law 

(B) STREET: 301 Henrietta 

(C) CITY: Kalamazoo 

(D) STATE: MI 

20 (E) COUNTRY: USA 

(F) ZIP: 49001 

(v) COMPUTER READABLE FORM: 

(A) MEDIUM TYPE: Floppy disk 

25 (B) COMPUTER: IBM PC compatible 

(C) OPERATING SYSTEM: PC-DOS/MS-DOS 

<D) SOFTWARE: Patentln Release #1.0, Version #1.25 

(VI) CURRENT APPLICATION DATA: 
30 (A) APPLICATION NUMBER: 

(B) FILING DATE: 

(C) CLASSIFICATION: 

(Viii) ATTORNEY/AGENT INFORMATION: 
35 (A) NAME: Jameson, William G. 

(B) REGISTRATION NUMBER: 27,199 

(C) REFERENCE/DOCKET NUMBER: 4731.1 CP 

(ix) TELECOMMUNICATION INFORMATION: 
40 (A) TELEPHONE: 616/385-7561 

(B) TELEFAX: 616/385-6897 

(C) TELEX: 224401 

-/ 

45 (2) INFORMATION FOR SEQ ID NO:l: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 85 amino acids 
<B) TYPE: amino acid 
50 (C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 



55 



(ii) MOLECULE TYPE: peptide 



(Xi) SEQUENCE DESCRIPTION: SEQ ID NO:l: 



Asn Leu Ala Lys Gly V/a Glu Glu Ser Leu Asp Ser Asp Leu Tyr Ala 
60 1 5 10 15 

Glu Leu Arg Cys Met Cys lie Lys Thr Thr Ser Gly lie His Pro Lys 
20 25 30 

65 Asn He Gin Ser Leu Glu Val He Gly Lys Gly Thr His Cys Asn Gin 

35 40 45 

Val Glu Val He Ala Thr Leu Lys Asp Gly Arg Lys He Cys Leu Asp 
50 55 60 
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Pro asp Ala Pro Arg lie Lys Lys II© Val Gin Lys Lys Leu Ala Gly 
65 70 75 80 

Asp Glu Ser Ala Asp 
5 85 

(2) INFORMATION FOR SEQ ID NO: 2: 

(i) SEQUENCE CHARACTERISTICS: 
10 (A) LENGTH: 255 base pairs 

(B) TYPE : nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

15 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2: 
AACTTGGCGA AAGGCAAAGA GGAAAGTCTA GACAGTGACT TGTATGCTGA ACTCCGCTGC 60 
ATGTGTATAA AGACAACCTC TGGAATTCAT CCCAAAAACA TCCAAAGTTT GGAAGTGATC 120 
GGGAAAGGAA CCCATTGCAA CCAAGTCGAA GTGATAGCCA CACTGAAGGA T GGG AGG AAA 180 
25 ATCTGCCTGG ACCCAGATGC TCCCAGAATC AAGAAAATTG TACAGAAAAA ATTGGCAGGT 240 
GATGAATCTG CTGAT 

<2) INFORMATION FOR SEQ ID NO: 3: 



20 



30 



35 



65 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 94 amino acids 

(B) TYPE: amino acid 
<D) TOPOLOGY: linear 



(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 3: 

40 Ser Ser Thr Lys Gly Gin Thr Lys Arg Asn Leu Ala Lys Gly Lys Glu 

15 10 15 

Glu Ser Leu Asp Ser Asp Leu Tyr Ala Glu Leu Arg Cys Met Cys He 
20 25 30 

45 

Lys Thr Thr Ser Gly He His Pro Lys Asn He Gin Ser Leu Glu Val 
35 40 45 

He Gly Lys Gly Thr His Cys Asn Gin Val Glu Val He Ala Thr Leu 
50 50 55 60 

Lys Asp Gly Arg Lys He Cys Leu Asp Pro Asp Ala Pro Arg He Lys 
65 70 75 80 

55 Lys He Val Gin Lys Lys Leu Ala Gly Asp Glu Ser Ala Asp 

85 90 

(2) INFORMATION FOR SEQ ID NO:4: 

60 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 282 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



255 



(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 4; 



10 
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TCCTCCACCA AAGGACAAAC TAAGAGAAAC TTGGCGAAAG GCAAAGAGGA AAGTCTAGAC 60 

AGTGACTTGT ATGCTGAACT CCGCTGCATG TGTATAAAGA CAACCTCTGG AATTCATCCC 120 

AAAAACATCC AAAGTTTGGA AGTGATCGGG AAAGGAACCC ATTGCAACCA AGTCGAAGTG 180 

ATAGCCACAC TGAAGGATGG GAGGAAAATC TGCCTGGACC CAGATGCTCC CAGAATCAAG 240 

AAAATTGTAC AGAAAAAATT GGCAGGTGAT GAATCTGCTG AT 282 
(2) INFORMATION FOR SEQ ID NO: 5: 



<i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 81 amino acids 
15 (B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 



20 



25 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 5: 

Gly Lys Glu Glu Ser Leu Asp Ser Asp Leu Tyr Ala Glu Leu Arg Cys 
15 10 15 

Met Cys lie Lys Thr Thr ser Gly He His Pro Lys Asn He Gin Ser 
20 25 30 

Leu Glu Val He Gly Lys Gly Thr His Cys Asn Gin Val Glu Val He 
30 35 40 45 

Ala Thr Leu Lys Asp Gly Arg Lys He Cys Leu Asp Pro Asp Ala Pro 
50 55 60 

35 Arg He Lys Lys He Val Gin Lys Lys Leu Ala Gly Asp Glu Ser Ala 

65 ~ 70 75 80 



40 



60 



Asp 



(2) INFORMATION FOR SEQ ID NO: 6 J 



<i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 24 Z base pairs 
45 (B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



50 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 6: 
GGCAAAGAGG AAAGTCTAGA CAGTGACTTG TATGCTGAAC TCCGCTGCAT GTGTATAAAG 60 
55 ACAACCTCTG GAATTCATCC CAAAAACATC CAAAGTTTGG AAGTGATCGG GAAAGGAACC 120 
CATTGCAACC AAGTCGAAGT GATAGCCACA CTGAAGGATG GGAGGAJ\A \T CTGCCTGGAC 180 
CCAGATGCTC CCAGAATCAA GAAAATTGTA CAGAAAAAAT TGGCAGGTGA TGAATCTGCT 240 
GAT 

(2) INFORMATION FOR SEQ ID NO: 7: 

65 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 69 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 



243 
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(xl) SEQUENCE DESCRIPTION: SEQ ID NO: 7: 

Glu Leu Arg Cys Met Cys lie Lys Thr Thr Ser Gly He His Pro Lys 
5 1 5 10 15 

Asn He Gin Ser Leu Glu Val He Gly Lys Gly Thr His Cys Asn Gin 
20 25 30 

10 Val Glu Val He Ala Thr Leu Lys Asp Gly Arg Lys He Cys Leu Asp 

35 40 45 



15 



20 



30 



40 



45 



55 



pro Asp Ala Pro Arg He Lys Lys He Val Gin Lys Lys Leu Ala Gly 
50 55 60 

Asp Glu Ser Ala Asp 
65 

(2) INFORMATION FOR SEQ ID NO: 8: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 207 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 
25 (D) TOPOLOGY: linear 



(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 8: 

GAACTCCGCT GCATGTGTAT AAAGACAACC TCTGGAATTC ATCCCAAAAA CATCCAAAGT 60 

TTGGAAGTGA TCGGGAAAGG AACCCATTGC AACCAAGTCG AAGTGATAGC CACACTGAAG 120 

35 GATGGGAGGA AAATCTGCCT GGACCCAGAT GCTCCCAGAA TCAAGAAAAT TG T ACAG AAA 180 

AAATTGGCAG GTGATGAATC TGCTGAT 207 
(2) INFORMATION FOR SEQ ID NO: 9: 



<i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 19 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 



(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 9: 



50 Asn Leu Ala Lys Gly Lys Glu Glu Ser Leu Asp Ser Asp Leu Tyr Ala 

15 10 15 



Glu Leu Arg 
(2) INFORMATION FOR SEQ ID NO : 10 : 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 16 amino acids 
60 (B) TYPE: amino acid 

(D) TOPOLOGY: linear 



65 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 10: 

Ser Ser Thr Lys Gly Gin Thr Lys Arg Asn Leu Ala Lys Gly Lys Glu 
1 5 10 15 
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(2) INFORMATION FOR SEQ ID NO: 11: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 17 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 



.10 <xi) SEQUENCE DESCRIPTION: SEQ ID NO: 11: 

Asn Leu Ala Lys Gly Lys Glu Glu Ser Leu Asp ser Asp Leu Tyr Ala 
15 10 15 

15 Glu 



2) INFORMATION FOR SEQ ID NO: 12: 

SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 101 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

MOLECULE TYPE: peptide 



20 



(i) 



25 



<ii) 



30 (Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 12: 

Met Ser Ser Ala Ala Gly Phe Cys Ala Ser Arg Pro Gly Leu Leu Phe 
1 5 10 15 

35 Leu Gly Leu Leu Leu Leu Pro Leu Val Val Ala Phe Ala Ser Ala Glu 

20 25 30 

Ala Glu Glu Asp Gly Asp Leu Gin Cys Leu Cys Val Lys Thr Thr Ser 
35 40 45 

40 

Gin Val Arg Pro Arg His He Thr Ser Leu Glu val He Lys Ala Gly 
50 55 60 

Pro His Cys Pro Thr Ala Gin Leu lie Ala Thr Leu Lys Asn Gly Arg 
45 65 70 75 80 

Lys He Cys Leu Asp Leu Gin Ala Pro Leu Tyr Lys Lys He He Lys 
85 90 95 

50 Lys Leu Leu Glu Ser 

100 

(2) INFORMATION FOR SEQ ID NO: 13: 

55 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 439 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

60 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 13: 
65 CCGCAGCATG AGCTCCGCAG CCGGGTTCTG CGCCTCACGC CCCGGGCTGC TGTTCCTGGG 60 
GTTGCTGCTC CTGCCACTTG TGGTCGCCTT CGCCAGCGCT GAAGCTGAAG AAGATGGGGA 120 
CCTGCAGTGC CTGTGTGTGA AGACCACCTC CCAGGTCCGT CCCAGGCACA TCACCAGCCT 180 



10 
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GGAGGTGATC AAGGCCGGAC CCCACTGCCC CACTGCCCAA CTGATAGCCA CGCTGAAGAA 240 

TGGAAGGAAA ATTTGCTTGG ACCTGCAAGC CCCGCTGTAC AAGAAAATAA TTAAGAAACT 300 

TTTGGAGAGT TAGCTACTAG CTGCCTACGT GTGTGCATTT GCTATATAGC ATACTTCTTT 360 

TTTCCAGTTT CAATCTAACT GTGAAAGAAA CTTCTGATAT TTGTGTTATC CTTATGATTT 420 

TAAATAAACA AAATAAATC 439 
(2) INFORMATION FOR SEQ ID NO: 14: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 101 amino acids 
15 (B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 



20 



45 



(ii) MOLECULE TYPE: peptide 



(Xi) SEQUENCE DESCRIPTION: SEQ ID NO : 14 : 



25 Met Asn Gin Thr Ala He Leu He Cys Cys Leu He Phe Leu Thr Leu 

15 10 15 

Ser Gly He Gin Gly Val Pro Leu Ser Arg Thr Val Arg Cys Thr Cys 
20 " 25 30 

30 

He Ser He Ser Asn Gin Pro Val Asn Pro Val Asn Pro Arg Ser Leu 
35 40 45 

Glu Lys Leu Glu He He Pro Ala Ser Gin Phe Cys Pro Arg Val Glu 
35 50 55 60 

He He Ala Thr Met Lys Lye Lys Gly Glu Lys Arg Cys Leu Asn Pro 
65 70 75 80 

40 Glu Ser Lys Ala He Lys Asn Leu Leu Lys Ala Val ser Lys Glu Met 

85 90 95 



Ser Lys Arg Ser Pro 
100 

(2) INFORMATION FOR SEQ ID NO: 15; 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 650 base pairs 
50 (B ) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 15: 

GAGACATTCC TCAATTGCTT AGACATATTC TGAGCCTACA GCAGAGGAAC CTCCAGTCTC 60 

60 AGCACCATGA ATCAAACTGC GATTCTGATT TGCTGCCTTA TCTTTCTGAC TCTAAGTGGC 120 

ATTCAAGGAG TACCTCTCTC TAGAACCGTA CGCTGTACCT GCATCAGCAT TAGTAATCAA 180 

CCTGTTAATC CAAGGTCTTT AG AAAAAC TT GAAATTATTC CTGCAAGCCA ATTTTGTCCA 240 

65 CGTGTTGAGA TCATTGCTAC AATGAAAAAG AAGGGTGAGA AGAGATGTCT GAATCCAGAA 300 

TCGAAGGCCA TCAAGAATTT ACTGAAAGCA GTTAGCAAGG AAATGTCTAA AAGATCTCCT 360 
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TAAAACCAGA GGGGAGCAAA ATCGATGCAG TGCTTCCAAG GATGGACCAC ACAGAGGCTG 420 

CCTCTCCCAT CACTTCCCTA CATGGAGTAT ATGTCAAGCC ATAATTGTTC TTAGTTTGCA 480 

GTTACACTAA AAGGTGACCA ATGATGGTCA CCAAATCAGC TGCTACTACT CCTGTAGGAA 540 

GGTTAATGTT CATCATCCTA AGCTATTCAG TAATAACTCT ACCCTGGCAC TATAATGTAA 600 

GCTCTACTGA GGTGCTATGT TCTTAGTGGA TGTTCTGACC CTGCTTCAAA 650 
(2) INFORMATION FOR SEQ ID NO: 16: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 107 amino acids 
15 (B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



20 



45 



65 



<ii) MOLECULE TYPE: peptide 



(xi) SEQUENCE DESCRIPTION: t:T5Q ID NO: 16: 

25 Met Ala Arg Ala Ala Leu Ser Ala Ala Pro Ser Asn Pro Arg Leu Leu 

1 5 10 15 

Ara val Ala Leu Leu Leu Leu Leu Leu Val Ala Ala Gly Arg Arg Ala 
20 25 30 

30 

Ala Gly Ala Ser Val Ala Thr Glu Leu Arg Cys Gin Cys Leu Gin Thr 
35 40 45 

Leu Gin Gly He His Pro Lys Asn He Gin Ser Val Asn val Lys Ser 
35 50 55 60 

Pro Gly Pro His Cys Ala Gin Thr Glu Val He Ala Thr Leu Lys Asn 
65 70 75 80 

40 Gly Arg Lys Ala Cys Leu Asn Pro Ala Ser Pro He Val Lys Lys He 

85 90 95 



He Glu Lys Met Leu Asn Ser Asp Lys Ser Asn 
100 105 

(2) INFORMATION FOR SEQ ID NO: 17: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1050 base pairs 
50 (B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



55 



(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 17: 

CTCGCCAGCT CTTCCGCTCC TCTCACAGCC GCCAGACCCG CCTGCTGAGC CCCATGGCCC 60 

60 GCGCTGCTCT CTCCGCCGCC CCCAGCAATC CCCGGCTCCT GCGAGTGGCA CTGCTGCTCC 120 

TGCTCCTGGT AGCCGCTGGC CGGCGCGCAG CAGGAGCGTC CGTGGCCACT GAACTGCGCT 180 

GCCAGTGCTT GCAGACCCTG CAGGGAATTC ACCCCAAGAA CATCCAAAGT GTGAACGTGA 240 

AGTCCCCCGG ACCCCACTGC GCCCAAACCG AAGTCATAGC CACACTCAAG AATGGGCGGA 300 

AAGCTTGCCT CAATCCTGCA TCCCCCATAG TTAAGAAAAT CATCGAAAAG ATGCTGAACA 360 
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GTGACAAATC CAACTGACCA GAAGGGAGGA GGAAGCTCAC TGGTGGCTGT TCCTGAAGGA 420 

GGCCCTGCCC TTATAGGAAC AGAAGAGGAA AGAGAGACAC AGCTGCAGAG GCCACCTGGA 480 

TTGTGCCTAA TGTGTTTGAG CATCGCTTAG GAGAAGTCTT CTATTTATTT ATTTATTCAT 540 

TAGTTTTGAA GATTCTATGT TAATATTTTA GGTGTAAAAT AATTAAGGGT ATGATTAACT 600 

CTACCTGCAC ACTGTCCTAT TATATTCATT CTTTTTGAAA TGTCAACCCC AAGTTAGTTC 660 

AATCTGGATT CATATTTAAT TTGAAGGTAG AATGTTTTCA AATGTTCTCC AGTCATTATG 720 

TTAATATTTC TGAGGAGCCT GCAACATGCC AGCCACTGTG ATAGAGGCTG GCGGATCCAA 780 

15 GCAAATGGCC AATGAGATCA TTGTGAAGGC AGGGGAATGT ATGTGCACAT CTGTTTTGTA 840 

ACTGTTTAGA TGAATGTCAG TTGTTATTTA TTGAAATGAT TTCACAGTGT GTGGTCAACA 900 

TTTCTCATGT TGAAACTTTA AGAACTAAAA TGXTCTAAAT ATCCCTTGGA CATTTTATGT 960 

CTTTCTTGTA AGGCATACTG CCTTGTTTAA TGGTAGTTTT ACAGTGTTTC TGGCTTAGAA 1020 

CAAAGGGGCT TAATTATTGA TGTTTTCGGA 1050 
25 (2) INFORMATION FOR SEQ ID NO: 18: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 102 amino aoids 

(B) TYPE: amino acid 

30 (C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



20 



35 



50 



(ii) MOLECDLE TYPE: peptide 



(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 18: 



Met Ala Arg Ala Thr Leu Ser Ala Ala Pro Ser Asn Pro Arg Leu Leu 
40 1 5 10 3.5 

Arq Val Ala Leu Leu Leu Leu Leu Leu val Ala Ala Ser Arg Arg Ala 
20 25 30 

45 Ala Gly Ala Pro Lys Ala Thr Glu Lys Arg Cys Gin Cys Lys Gin Thr 

35 * 40 4 5 

Leu Gin Gly lie His Leu Lys Asn lie Gin Ser Val Lys Val Lys Ser 
50 55 60 

Pro Glv Pro His Cys Ala Gin Thr Glu Val He Ala Thr Leu Lys Asn 
65 70 75 80 

Glv Gin Lys Ala Cys Leu Asn Pro Ala Ser Pro Met Val Lys Lys He 
55 85 90 95 

He Glu Lys Met Leu Lys 
100 

60 (2) INFORMATION FOR, SEQ ID NO: 19: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1081 base pairs 

(B) TYPE: nucleic acid 
65 (C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 
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(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 19: 





CTCTCCTCCT 


CGCACAGCCG 


CTCGAACCGC 


CTGCTGAGCC 


CCATGGCCCG CGCCACGCTC 


60 


5 


TCCGCCGCCC 


CCAGCAATCC 


CCGGCTCCTG 


CGGGTGGCGC 


TGCTGCTCCT GCTCCTGGTG 


120 




GCCGCCAGCC 


GGCGCGCAGC 


AGGAGCGCCC 


CTGGCCACTG 


AACTGCGCTG CCAGTGCTTG 


180 


in 


CAGACCCTGC 


AGGGAATTCA 


CCTCAAGAAC 


ATCCAAAGTG 


TGAAGGTGAA GTCCCCCGGA 


240 


CCCCACTGCG 


CCCAAACCGA 


AGTCATAGCC 


ACACTCAAGA 


ATGGGCAGAA AGCTTGTCTC 


300 




AACCCCGCAT 


CGCCCATGGT 


TAAGAAAATC 


ATCGAAAAGA 


TGCTGAAAAA TGGCAAATCC 


360 


15 


AACTGACCAG 


AAGGAAGGAG 


GAAGCTTATT 


GGTGGCTGTT 


CCTGAAGGAG GCCCTGCCCT 


420 




TACAGGAACA 


GAAGAGGAAA 


GAGAGACACA 


GCTGCAGAGG 


CCACCTGGAT TGCGCCTAAT 


480 


20 


GTGTTTGAGC 


AT C ACTTAGG 


AGAAGTCTTC 


TATTTATTTA 


TTTATTTATT TATTTGTTTG 


540 


TTTTAGAAGA 


TTCTATGTTA 


ATATTTTATG TGTAAAATAA GGTTATGATT GAATCTACTT 


600 




GCACACTCTC 


CCATTATATT 


TATTGTTTAT TTTAGGTCAA ACCCAAGTTA GTTCAATCCT 


660 


25 


GATTCATATT 


TAATTTGAAG 


ATAGAAGGTT 


TGCAGATATT 


CTCTAGTCAT TTGTTAATAT 


720 




TTCTTCGTGA 


TGACATATCA 


CATGTCAGCC 


ACTGTGATAG 


AGGCTGAGGA ATCCAAGAAA 


780 


30 


ATGGCCAGTG 


AGATCAATGT 


GACGGCAGGG 


AAATGTATGT 


GTGTCTATTT TGTAACTGTA 


840 


AAGATGAATG 


TCAGTTGTTA 


TTTATTGAAA 


TGATTTCACA 


GTGTGTGGTC AACATTTCTC 


900 




ATGTTGAAGC 


TTTAAGAACT 


AAAATGTTCT 


AAATATCCCT 


TGGACATTTT ATGTCTTTCT 


960 


35 


TGTAAGGCAT 


ACTGCCTTGT 


TTAATGTTAA 


TTATGCAGTG 


TTTCCCTCTG TGTTAGAGCA 


1020 




GAGAGGTTTC 


GATATTTATT 


GATGTTTTCA 


CAAAGAACAG 


GAAAATAAAA TATTTAAAAA 


1080 


40 


T 

<2) INFORMATION FOR SEQ ID NO: 20 






1081 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 107 amino acids 
45 (B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: peptide 

50 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 20: 

55 Met Ala His Ala Thr Leu Ser Ala Ala Pro Ser Asn Pro Arg Leu Leu 

1 5 10 15 

Arg Val Ala Leu Leu Leu Leu Leu Leu Val Ala Ala Ser Arg Arg Ala 
20 25 30 

60 

Ala Gly Ala Ser Val Val Thr Glu Leu Arg Cys Gin Cys Leu Gin Thr 
35 40 45 

Leu Gin Gly lie His Leu Lys Asn lie Gin Ser Val Asn val Arg Ser 
65 50 55 60 

Pro Gly Pro His Cys Ala Gin Thr Glu Val lie Ala Thr Leu Lys Asn 
65 70 75 80 



20 



30 



40 



50 



60 



(ii) MOLECULE TYPE: peptide 



60 
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Gly Lys Lys Ala Cys Leu Asn Pro Ala Ser Pro Met Val Gin Lys lie 

85 90 95 

lie Glu Lys lie Leu Asn Lys Gly Ser Thr Asn 
5 100 105 

(2) INFORMATION FOR SEQ ID NO: 21: 

(i) SEQUENCE CHARACTERISTICS : 
10 <A) LENGTH: 988 base pairs 

(B ) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

15 

(xl) SEQUENCE DESCRIPTION: SEQ ID NO: 21: 
CTCGCACAGC TTCCCGACGC GTCTGCTGAG CCCCATGGCC CACGCCACGC TCTCCGCCGC 
CCCCAGCAAT CCCCGGCTCC TGCGGGTGGC GCTGCTGCTC CTGCTCCTGG TGGCCGCCAG 
CCGGCGCGCA GCAGGAGCGT CCGTGGTCAC TGAACTGCGC TGCCAGTGCT TGCAGACACT 180 
25 GCAGGGAATT CACCTCAAGA ACATCCAAAG TGTGAATGTA AGGTCCCCCG GACCCCACTG 240 
CGCCCAAACC GAAGT C AT AG CCACACTCAA GAATGGGAAG AAAGCTTGTC TCAACCCCGC 300 
ATCCCCCATG GTTCAGAAAA TCATCGAAAA GATACTGAAC AAGGGGAGCA CCAACTGACA 360 
GGAGAGAAGT AAGAAGCTTA TCAGCGTATC ATTGACACTT CCTGCAGGGT GGTCCCTGCC 420 
CTTACCAGAG CTGAAAATGA AAAAGAGAAC AGCAGCTTTC TAGGGAOAGG TGGAAAGGAO 480 
35 TTAATGTGTT TGACTATTTC TTACGAGGGT TCTACTTATT TATGTATTTA TTTTTGAAAG 540 
CTTGTAT TTT AATATTTTAC ATGCTGTTAT TTAAAGATGT GAGTGTGTTT CATCAAACAT 600 
AGCTCAGTCC TGATTATTTA ATTGGAATAT GATGGGTTTT AAATGTGTCA TTAAACTAAT 660 
ATTTAGTGGG AGACCATAAT GTGTCAGCCA CCTTGATAAA TGACAGGGTG GGGAACTGGA 720 
GGGTGGGGGG ATTGAAATGC AAGCAATTAG TGGATCACTG TTAGGGTAAG GGAATGTATG 780 
45 TACACATCTA TTTTTTATAC TTTTTTTTTA AAAAAAGAAT GTCAGTTGTT ATTTATTCAA 840 
ATTATCTCAC ATTATGTGTT CAACATTTTT ATGCTGAAGT TTCCCTTAGA CATTTTATGT 900 
CTTGCTTGTA GGGCATAATG CCTTGTTTAA TGTCCATTCT GCAGCGTTTC TCTTTCCCTT 960 
GGAAAAGAGA ATTTATCATT ACTGTTAC 
(2) INFORMATION FOR SEQ ID NO: 22: 

55 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH : 97 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



988 



65 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 22: 

Met Thr Ser Lys Leu Ala Val Ala Leu Leu Ala Ala Phe Leu lie Ser 

10 15 



1 5 
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Ala Ala Leu Cys Glu Gly Ala Val Leu Pro Arg Ser Ala Lys Glu Leu 
20 25 30 

Arg Cys Gin Cys He Lys Thr Tyr Ser Lys Pro Phe His Pro Lys Phe 
5 35 40 45 

He Lys Glu Leu Arg Val He Glu Ser Gly Pro His Cys Ala Asn Thr 
50 55 60 

10 Glu He He Val Lys Leu Ser Asp Gly Arg Glu Leu Cys Leu Asp Pro 

65 70 75 80 

Lys Glu Asn Trp Val Gin Arg Val Val Glu Lys Phe Leu Lys Arg Ala 
85 90 95 

15 

Glu 



(2) INFORMATION FOR SEQ ID NO: 23: 

20 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 291 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 
25 (D) TOPOLOGY: linear 



30 



(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 23: 

ATGACTTCCA AGCTGGCCGT GGCTCTCTTG GCAGCCTTCC TGATTTCTGC AGCTCTGTGT 60 

GAAGGTGCAG TTTTGCCAAG G AGTGCT AAA GAACTTAGAT GTCAGTGCAT AAAGACATAC 120 

35 TCCAAACCTT TCCACCCCAA ATTTATCAAA GAACTGAGAG TGATTGAGAG TGGACCACAC 180 

TGCGCCAACA CAGAAATTAT TGTAAAGCTT TCTGATGGAA GAGAGCTCTG TCTGGACCCC 240 

AAGGAAAACT GGGTGCAGAG GGTTGTGGAG AAGTTTTTGA AGAGGGCTGA G 291 

(2) INFORMATION FOR SEQ ID NO: 24: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 78 amino acids 
45 (B) TYPE: amino acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: linear 



40 



50 



(ii) MOLECULE TYPE: peptide 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 24: 



55 Ala Gly Pro Ala Ala Ala Val Leu Arg Glu Lys Arg Cys Val Cys Leu 

X 5 10 15 

Gin Thr Thr Gin Gly Val His Pro Lys Met He Ser Asn Leu Gin Val 
20 25 30 

60 

Phe Ala He Gly Pro Gin Cys Ser Lys Val Glu Val Val Ala Ser Leu 
35 40 45 

Lys Asn Gly Lys Glu He Cys Leu Asp Pro Glu Ala Pro Phe Leu Lys 
65 50 55 60 

Lys Val He Gin Lys He Leu Asp Gly Gly Asn Lys Glu Asn 
65 70 75 
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20 



30 



(2) INFORMATION FOR SEQ ID NO: 25: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 216 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 



(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 25: 

GTGTTGCGGG AACTGCGGTG CGTGTGTTTA CAGACCACGC AGGGAGTTCA TCCCAAAATG 60 

15 ATCAGTAATC TGCAAGTGTT CGCCATAGGC CCACAGTGCT CCAAGGTGGA AGTGGTAGCC 120 

TCCCTGAAGA ACGGGAAGGA AATTTGTCTT GATCCAGAAG CCCCTTTTCT AAAGAAAGTC 180 

ATCCAGAAAA TCCTCGACGG CGGCAACAAA GAAAAC 216 

(2) INFORMATION FOR SEQ ID NO: 26: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 93 amino acids 
25 (B) TYPE: amino acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: peptide 



(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 26: 



35 Met Gin Val Ser Thr Ala Ala Leu Ala Val Leu Leu Cys Thr Met Ala 

1 5 10 15 

Leu Cys Asn Gin Val Leu Ser Ala Pro Leu Ala Ala Asp Thr Pro Thr 
20 25 30 

40 

Ala Cys Cys Phe Ser Tyr Thr Ser Arg Gin lie Pro Gin Asn Phe lie 
35 40 45 

Ala Asp Tyr Phe Glu Thr Ser Ser Gin Cys Ser Lys Pro Ser Val lie 
45 50 55 60 

Phe Leu Thr Lys Arg Gly Arg Gin Val Cys Ala Asp Pro Ser Glu Glu 
65 70 75 80 

50 Trp val Gin Lys Tyr Val Ser Asp Leu Glu Leu Ser Ala 

85 90 

(2) INFORMATION FOR SEQ ID NO: 27: 

55 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 4788 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: linear 

60 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 27: 
65 GAATTCAAGG CCTGTCCTGG TTTGGTCCCA ATTTACCTTT ATCATCCATA TTCACCCCCA 60 
CTGCTCTGCA GCTCCAC TG A AGCACCCCCT CTTTCCTCTG AGCCACAATG TCACACCCAG 120 
GACTCTGCCT CAGCTGGGCC TCCACTGCCC ACCCATCTAT AGATGCCTAA ATCCCGGGCA 180 



WO 95/04158 

GTTATCCAGA CACAACTAAA GTTCCATCCC 
GAAGGTCACT TCTTCCTCAT GGGGTTCTGA 
5 ACAATTACCT GTTCATACAC TCTACCTGCC 
AGATCATTAA AACCAACTAA ATCTATTTCT 
CAAAATAATA AAATAACAAA GTATGGAAAA 

10 

AGGTAGAGAC CCTTCCACAC CACTGAAATT 
TTCAGTTTGG CATTATTTAT GTAAGTATAG 
15 GGGAAATCAA GACTTCATTT TGGCAAAGCC 
TAATTTCTTT CAGATTAAAC AACTGACAAC 
TGTTTTAAAA CAAAATTAGT T CAT AAATT T 

20 

AC T AAAT AAA ATGAGGAAAA TTCAGAAAAA 
ATCTAAGCAC TATTCATTCT TTAAGGCATG 
25 TAGAGTTGGC ATGGCATATA TATATCTTTA 
TAATGTTTTA TCTTGCTTTT TTTTTAACCA 
ATTCACGTTA GAGACTTAAT AATAAAGGAT 

30 

GAAAAATTTA GCTTTGCTTA TTTTAGAGTT 
TTTGAAGATG GCTCCATATT TGGGTTGTTT 
35 TAGGTTAATG GCTACACATA TTTCTAGGCA 
TTTTTATGAT CCACAACTAG CGTTTAACAC 
AGACAAATGA CTGAAACGTG ACCTCATGCT 

40 

TTCCTCTGGG AGGACTGGGG GTTGTCTAGC 
CTTGTGGTTA TAGCAGCTGA GGAAGCAGAA 
45 AGAGTTCATG CATAGACCAA TTCTTTTTTT 
GTTGCCCAGG CTGGAGTGCA ATGGCATGAT 
TTCAAGCGAT TCTCCTGCCC TCAGCCTCCC 

50 

ACGCCTGACT ACTTTTGTAT TTTTAGTAGA 
TCTCAAACTC CTGACCTCAG GTGATCTGCA 
55 GTGTGAGCGA CCATGCCTGG CTGCATAGAC 
TAGCCTTGGG TTGACACACA CCCCTCTTCA 
ACCAGTCATG AGTTGAGAGC TGAGAGTTAG 

60 

TCCTGAGCCC CTGTGGTCAC CAGGGACCCT 
ACTACACTTA AAAATTTCCC TCCTCACCCC 
65 CCTATAAAGA GGAGAGATGG CT TC AG AC AT 
GTCCCTTCTT GGCTCTGCTG ACACTCGAGC 
AGGTCTCCAC TGCTGCCCTT GCCGTCCTCC 
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TTCCATGAAG 


CCTTCCCCAA 


CCCTCTGGTG 


240 


GCTTTCATTT 


CTTTTTCTAC 


TAAGAGTTTT 


300 


CCCATGAGAC 


CAGGGGCATC 


TCAGAAACAA 


360 


CATTATAAAA 


TGAGATATGC 


TGATTGATTG 


420 


GAAAAAAAAA 


AGCATATAAT 


CTGGCTGAGA 


480 


ATGTGTTGAA 


AAGAATAAGG 


AAAAAACTGC 


540 


TATAGGATCC 


TTAAAATGGT 


TCAAAGAAAT 


600 


ATTGAACAGA 


AACTGTAGCA 


TATTTATCAG 


660 


AACCCACTTT 


TCAACCAGTG 


ATGTTGGAAA 


720 


GTGGGTTGAC 


CAAGAAGGTA ATAAAGTCTC 


780 


GAAAAAAATA 


AGAAAATAAA 


TCACCCATGG 


840 


TATTTCCAAG 


CCTTTTAATT 


TTTTCATGCC 


900 


TACAATTCTT 


CAAATTTTAT 


AGAATTTGTA 


960 


CTGATGTTAT 


AAGCATATTT 


ATGCCACTTC 


1020 


CTTGTGGATA ATTTATCATT 


CCCTGATAGA 


1080 


ATAAATGATG 


CTGGGTCAGG 


TATCTTTATG 


1140 


CCACAGAACT 


CTTTCCAGAA 


ATGCTTTTTC 


1200 


CCTGACATAC 


TGACACCCAC 


CTCTAAAGTA 


1260 


AGCGCCCC? 0 


TCACTCCGAG 


ACTAATAAAT 


1320 


TTCTATTCCT 


CCAGCTTTCA 


TTGAGTTCCT 


13B0 


CCTCCACAGC 


ATCAGCCCAT 


TGACCCTATC 


1440 


TTACAGCTCT 


GTGGGAAGGA 


ATGGGGCTGG 


1500 




TGAGATGGAG 


TTTCACTTTT 


1560 


CTCAGCTCAC 


CACAGCCCCC 


ACCTCCTGGG 


1620 


GAGTAGCTGG 


GATTACAGGC 


ATGTGCCACC 


1680 


GATGGAGTTT 


CTCTTTCTTG 


GTCAGGTTGG 


1740 


GCCTCGGCCT CCAAAGTGTT 


GGGATTACAG 


1800 


CAGTTCTTAT 


GAGAAGGGAT 


CAACTAAGAA 


1860 


CACTCACAGG 


AGAAACCCCA 


TGAAGCTAGA 


19"0 


AGAGTAGCTC 


AGAGATGCTA 


TTCTTGGATA 


19i.O 


GAGTTGTGCA ACACTCAGCA TGACAGCATC 


2040 


CAGATTCCAT 


TTCCCCATCC 


GCCAGGGCTG 


2100 


CAGAAGGACG 


CAGGCAGCAA 


AGAGTAGTCA 


2160 


CCACATTCCA 


TCACCTGCTC 


CCAATCATGC 


2220 


TCTGCACCAT 


GGCTCTCTGC 


AACCAGGTCC 


2280 
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TCTCTGCACC ACGTGAGTCC ATGTTGTTGT TGTGGGTATC ACCACTCTCT GGCCATGGTT 2340 

AGACCACATC AGTCTTTTTT TGTGGCGTGA GAGGCCCCGA AGAGAAAAGA AGGAAGTTCT 2400 

5 TAAAGCGCTG CCAAACACCT TGGTCTTTTT CTTCACAACT TTTATTTTTA TCTCTAGAAG 2460 

GGGTCTTAGC CCTCCTAGTC TCCAGGTATG AGAATCTAGG CAGGGGCAGG GGAGTTACAG 2520 

TCCCTTGTAC AGATAGAAAA ACAGGGTTCA AAACGAATCA GTTTGCAAGA GGCAGAATCC 2580 

10 AGGGCTGCTT ACTTCCCAGT GGGGTCTGTT CTTCACTCTC CAGCTCACCC TAGTC TCCC A 2640 

GGAGCCCTGT CCCTTGGATG TCTTATGAGA GATGTCCAGG GCTTCTCTTG GGCTGGGGTA 2700 

15 TGACTTCTTG AACCGACAAA ATTCCATGAA GAGAGCTAAG AGAACAGTCC ATTCAGGTAT 2760 

CTGGATCACA TAGAGAAACA GAGAACCCAC TATGAAGAGT CAAGGGGAAA GAGGAATATA 2820 

GACAGAAACA AAG AG AC AT T TCTCTGCAAA ACCCCCCAAA TGCCTTGCAG TCACTTGGTC 2880 

20 

TGAGCAAGCC TGCCCTCCTC AACCACTCAG GGATCAGAAG CTGCCTGGCC TTTTCTTCTG 2940 

AGCTGTGACT TGGGCTTATT CTCTCCTTTC TCCGCAGTTG CTGCTGACAC GCCGACCGCC 3000 

25 TGCTGCTTCA GCTACACCTC CCGACAGATT CCACAGAATT TCATAGCTGA CTACTTTGAG 3060 

ACGAGCAGCC AGTGCTCCAA GCCCAGTGTC ATGTAAGTGC CAGTCTTCCT GCTCACCTCT 3120- 

AGGGAGGTAG GGAGTGTCAG GGTGGGGGCA GAAACAGGCC AGAAGGCCAT CCTGGAAAGG 3180 

30 

CCCAGCCTTC AGGAGCC TAT CGGGGATACA GGACGCAGGG CACTGAGGTG TGACCTGACT 3240 

TGGGGCTGGA GTGAGGTGGG TGTTACAGAG TCAGGAAGGG CTGCCCCAGG CCAGAGGAAA 3300 

35 GGGACAGGAA GAAGGAGGCA GCAGGACACT CTGAGGGCCC CCTTGCCTGG AGTCACTG AG 3360 

AGAAGCTCTC TAGACGGAGA TAGGCAGGGG GCCCCTGAGA GAGGAGCAGG CCTTGAGCTG 3420 

CCCAGGACAG AGAGCAGGAT GTCAGGGCCA TGGTGGGCCC AGGATTCCCC GGCTGGATTC 3480 

40 

CCCAGTGCTT AACTCTTCCT CCCTTCTCCA CAGCTTCCTA ACCAAGAGAG GCCGGCAGGT 3540 

CTGTGCTGAC CCCAGTGAGG AGTGGGTCCA GAAATACGTC AGTGACCTGG AGCTGAGTGC 3600 

45 CTGAGGGGTC CAGAAGCTTC GAGGCCCAGC GACCTCAGTG GGCCCAGTGG GGAGGAGCAG 3660 

GAGCCTGAGC CTTGGGAACA TGCGTGTGAC CTCCACAGCT ACCTCTTCTA TGGACTGGTT 3720 

ATTGCCAAAC AGCCACACTG TGGGACTCTT CTTAACTTAA ATTTTAATTT ATTTATACTA 3780 

50 TTTAGTTTTT ATAATTTATT TTTGATTTCA CAGTGTGTTT GTGATTGTTT GCTCTGAGAG 3840 

TTCCCCCTGT CCCCTCCACC TTCCCTCACA GTGTGTCTGG TGACAACCGA GTGGCTGTCA 3900 

55 TCGGCCTGTG TAGGCAGTCA TGGCACCAAA GCCACCAGAC TGACAAATGT GTATCAGATG 3960 

CTTTTGTTCA GGGCTGTGAT CGGCCTGGGG AAATAATAAA GATGTTCTTT TAAACGGTAA 4020 

ACCAGTATTG AGTTTGGTTT TGTTTTTCTG GCAAATCAAA ATCACTGGTT AAGAGGAATC 4080 

60 

ATAGGCAAAG ATTAGGAAGA GGTGAAATGG AGGGAAATTG GGAGAGATGG GGAGCGCTGC 4140 
GACAGAGTTA TCCACTTCAC AAAATTCTGG AACATTGAAA CTACGAATAT GTTATAACTC 4200 
65 AAATCGTAAT ATGCACGCTC TAGGAGAATT AACTACTTGA ATGGCCACCA TTAAGCAGAG 4260 
TATTCTGTAG GGCATATTCA TGATGAATCA AGCTCTTAAT AGCAATTATT TACATTGTTG 4 320 
AGGCTTACTC CTCCTACTGA GTGCTTTTTA TACATTGTTC ATTTAATCTT ACCAATGCAA 4 380 
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TAGTACAGCT TAGGTACTAT TAATACCTCC ACTTGACAGA AAAGTAACCC AGGGCTCAGA 4440 

AAGGTTAGAC AACTTGGCTG AGGTTACACA GCACGTAAAC GGTCAATTGT GTTCCAAAAC 4500 

i TGGACTTTTA TTGAACTACA G AC T ATGCTG TTAACCATTG ACCAAGTTAT TTCCCAAAGT 4560 

ATGACCCGCC TATACTCAAA TCTTACCCCA TTCTTTAACA G ATGAT ACT T TATCCATTGC 4620 

AACCACTTCC TGTCAGGATT CTGAGTTGAC ATAGAGTGTT TCAGCAGTGA TTATTTAAGC 4680 

CAATTACATC AGGATCTTTA GGTGTAGACC TGGGAACTGA TATTTTTATC AAGCTCATGA 4740 

GGTGTTCCAT AGCATGTTAA TGACTGAGAG CCACTGTCAA TAGAATTC 4788 
15 (2) INFORMATION FOR SEQ ID NO: 28: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 92 amino acids 

(B) TYPE: amino acid 

20 (C) STRAND EDNESS : single 

(D) TOPOLOGY: linear 



10 



25 



40 



60 



(ii) MOLECULE TYPE: peptide 



<xi) SEQUENCE DESCRIPTION: SEQ ID NO: 28: 



Met Lys Leu Cys Val Thr Val Leu Ser Leu Leu Met Leu val Ala Ala 
30 1 5 10 15 

Phe Cys Ser Pro Ala Leu Ser Ala Pro Met Gly Ser Asp Pro Pro Thr 
20 25 30 

35 Ala Cys Cys Phe Ser Tyr Thr Ala Arg Lys Leu Pro Arg Asn Phe Val 

35 40 45 

Val Asp Tyr Tyr Glu Thr Ser Ser Leu Cys Ser Gin Pro Ala Val Val 
50 55 60 

Phe Gin Thr Lys Arg Ser Lys Gin Val Cys Ala Asp Pro Ser Glu Ser 
65 70 75 80 

Trp Val Gin Glu Tyr Val Tyr Asp Leu Glu Leu Asn 
45 85 90 

(2) INFORMATION FOR SEQ ID NO: 29: 

(i) SEQUENCE CHARACTERISTICS: 
50 (A) LENGTH: 696 base pairs 

(B) TYPE: nucleic acid 

(C) STRAND EDNESS : single 

(D) TOPOLOGY: linear 

55 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 29: 
TTCCCCCCCC CCCCCCCCCC CCCCGCCCGA GCACAGGACA CAGCTGGGTT CTGAAGCTTC 60 
TGAGTTCTGC AGCCTCACCT CTGAGAAAAC CTCTTTTCCA CCAATACCAT GAAGCTCTGC 120 
GTGACTGTCC TGTCTCTCCT CATGCTAGTA GCTGCCTTCT GCTCTCCAGC GCTCTCAGCA 180 
65 CCAATGGGCT CAGACCCTCC CACCGCCTGC TGCTTTTCTT ACACCGCGAG GAAGCTTCCT 240 
CGCAACTTTG TGGTAGATTA CTATGAGACC AGCAGCCTCT GCTCCCAGCC AGCTGTGGTA 300 
TTCCAAACCA AAAGAAGCAA GCAAGTCTGT GCTGATCCCA GTGAATCCTG GGTCCAGGAG 360 
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GGGCTCTGGA AACCACATGG CTTCACCTGT CCCCGAAACT ACCAGCCCTA CACCATTCCT 420 

TCTGCCCTGC TTTTGCTAGG TCACAGAGGA TCTGCTTGGT CTTGATAAGC TATGTTGTTG 480 

5 CACTTTAAAC ATTTAAATTA TACAATCATC AACCCCCAAC 520 

(2) INFORMATION FOR SEQ ID NO: 32: 

(i) SEQUENCE CHARACTERISTICS: 
10 (A) LENGTH: 99 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

15 (ii) MOLECULE TYPE: peptide 



20 



(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 32: 

Met Lys Val Ser Ala Ala Leu Leu Cys Leu Leu Leu lie Ala Ala Thr 
15 10 15 

Phe lie Pro Gin Gly Lys Ala Gin Pro Asp Ala lie Asn Ala Pro Val 
25 20 25 30 

Thr Cys Cys Tyr Asn Phe Thr Asn Arg Lys lie Ser Val Gin Arg Leu 
35 40 45 

30 Ala Ser Tyr Arg Arg lie Thr Ser Ser Lys cys Pro Lys Glu Ala val 

50 '55 60 

lie Phe Lys Thr lie Val Ala Lys Glu lie Cys Ala Asp Pro Lys Gin 
65 70 75 B0 

Lys Trp Val Gin Asp Ser Met Asp His Leu Asp Lys Gin Thr Gin Thr 
85 90 95 

Pro Lys Thr 



35 



40 



55 



65 



<2) INFORMATION FOR SEQ ID NO: 33: 



(i) SEQUENCE CHARACTERISTICS: 
45 (A) LENGTH: 725 base pairs 

(B ) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

50 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 33: 
CTAACCCAGA AACATCCAAT TCTCAAACTG AAGCTCGCAC TCTCGCCTCC AGCATGAAAG 60 
TCTCTGCCGC CCTTCTGTGC CTGCTGCTCA TAGCAGCCAC CTTCATTCCC CAAGGGCTCG 120 
CTCAGCCAGA TGCAATCAAT GCGCCAGTCA CGTGGTGT1A TAACITCAIC AATAGGAAGA 180 
60 TCTCAGTGCA GAGGCTCGCG AGCTATAGAA GAATCACCAG CAGCAAGTGT CCCAAAGAAG 240 
CTGTGATCTT CAAGACCATT GTGGCCAAGG AGATCTGTGC TGACCCOAAG CAGAAGTGGG 300 
TTCAGGATTC CATGGACCAC CTGGACAAGC AAACCCAAAC TCCGAAGACT TGAACACTCA 360 
CTCCACAACC CAAGAATCTG CAGCTAACTT ATTTTCCCCT AGCTTTCCCC AGACACCCTG 420 
TTTTATTTTA TTATAATGAA TTTTGTTTGT TGATGTGAAA CATTATGCCT TAAGTAATGT 480 



10 
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TAATTCTTAT TTAAGTTATT GATGTTTTAA GTTTATCTTT CATGGTACTA GTGTTTTTTA 540 
GATACAGAGA CTTGGGGAAA TTGCTTTTCC TCTTGAACCA CAGTTCTACC CCTGGGATGT 600 
TTTGAGGGTC TTTGCAAGAA TCATTAATAC AAAGAATTTT TTTTAACATT CCAATGCATT 660 
GCTAAAATAT TATTGTGGAA ATGAATATTT TGTAACTATT ACACCAAATA AATATATTTT 720 
TGTAC 

(2) INFORMATION FOR SEQ ID NO; 34: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 99 amino acids 
15 (B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 



20 



30 



45 



65 



(ii) MOLECULE TYPE: peptide 



(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 34: 



25 Met Lys Ala Ser Ala Ala Leu Leu Cys Leu Leu Leu Thr Ala Ala Ala 

15 10 15 



Phe Ser Pro Gin Gly Leu Ala Gin Pro Val Gly He Asn Thr Ser Thr 
20 25 30 

Thr Cys Cys Tyr Arg Phe He Asn Lys Lys He Pro Lys Gin Arg Leu 
35 40 45 



Glu Ser Tyr Arg Arg Thr Thr Ser Ser His Cys Pro Arg Glu Ala Val 
35 50 ~ 55 60 

He Phe Lys Thr Lys Leu Asp Lys Glu He Cys Ala Asp Pro Thr Gin 
65 70 75 B0 

40 Lys Trp Val Gin Asp Phe Met Lys His Leu Asp Lys Lys Thr Gin Thr 

85 90 95 



Pro Lys Leu 



(2) INFORMATION FOR SEQ ID NO; 35: 



<i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 810 base pairs 
50 (B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



55 



725 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 35: 

AGCAGAGGGG CTGAGACCAA ACCAGAAACC TCCAATTCTC ATGTGGAAGC CCATGCCCTC 60 

60 ACCCTCCAAC ATGAAAGCCT CTGCAGCACT TCTGTGTCTG CTGCTCACAG CAGCTGCTTT 120 

CAGCCCCCAG GGGCTTGCTC AGCCAGTTGG GATTAATACT TCAACTACCT GCTGCTACAG 180 

ATTTATCAAT AAGAAAATCC CTAAGCAGAG GCTGGAGAGC TACAGAAGGA CCACCAGTAG 240 

CCACTGTCCC CGGGAAGCTG TAATCTTCAA GACCAAACTG GACAAGGAGA TCTGTGCTGA 300 

CCCCACACAG AAGTGGGTCC AGGACTTTAT GAAGCACCTG GACAAGAAAA CCCAAACTCC 360 
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AAAGCTTTGA ACATTCATGA CTG AAC T AAA AACAAGCCAT GACTTGAGAA ACAAATAATT 420 

TGTATACCCT GTCCTTTCTC AGAGTGGTTC TG AG ATT ATT TTAATCTAAT TCTAAZXSAAT 480 

I ATGAGCTTTA TGTAATAATG TGAATCATGG TTTTTCTTAG TAGATTTTAA AAGTTATTAA 540 

TATTTTAATT TAATCTTCCA TGGATTTTGG TGGGTTTTGA ACATAAAGCC TTGGATGTAT 600 

ATGTCATCTC AGTGCTGTAA AAACTGTGGG ATGCTCCTCC CTTCTCTACC TCATGGGGGT 660 

ATTGTATAAG TCCTTGCAAG AATCAGTGCA AAGATTTGCT TTAATTGTTA AGATATGATG 720 

TCCCTATGGA AGCATATTGT TATTATATAA TTACATATTT GCATATGTAT GACTCCCAAA 780 

15 TTTTCACATA AAATAGATTT TTGTAAAAAA 810 

(2) INFORMATION FOR SEQ ID NO: 36: 

(i) SEQUENCE CHARACTERISTICS: 
20 (A) LENGTH: 91 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

25 <ii) MOLECULE TYPE: peptide 



10 



30 



45 



50 



60 



(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 36: 

Met Lys Val Ser Ala Ala Arg Leu Ala Val He Leu He Ala Thr Ala 
15 10 15 



Leu Cys Ala Pro Ala Ser Ala ser Pro Tyr ser Ser Asp Thr Thr Pro 
35 20 25 30 

Cys Cys Phe Ala Tyr He Ala Arg Pro Leu Pro Arg Ala His He Lys 
35 40 45 

40 Glu Tyr Phe Tyr Thr Ser Gly Lys Cys Ser Asn Pro Ala Val Val Phe 

50 55 60 



Val Thr Arg Lys Asn Arg Gin Val Cys Ala Asn Pro Glu Lys Lys Trp 
65 70 75 80 

Val Arg Glu Tyr He Asn Ser Leu Glu Met Ser 
85 90 

(2) INFORMATION FOR SEQ ID NO: 37: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1160 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 
55 (D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 37; 

CCTCCGACAG CCTCTCCACA GGTACCATGA AGGTCTCCGC GGCACGCCTC GCTGTCATCC 60 

TCATTGCTAC TGCCCTCTGC GCTCCTGCAT CTGCCTCCCC ATATTCCTCG GACACCACAC 120 

65 CCTGCTGCTT TGCCTACATT GCCCGCCCAC TGCCCCGTGC CCACATCAAG GAGTATTTCT 180 

ACACCAGTGG CAAGTGCTCC AACCCAGCAG TCGTCTTTGT CACCCGAAAG AACCGCCAAG 240 

TGTGTGCCAA CCCAGAGAAG AAATGGGTTC GGGAGTACAT CAACTCTTTG GAGATGAGCT 300 
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AGGATGGAGA GTCCTTGAAC CTGAACTTAC ACAAATTTGC CTGTTTCTGC TTGCTCTTGT 360 

CCTAGCTTGG GAGGCTTCCC CTCACTATCC TACCCCACCC GCTCCTTGAA GGGCCCAGAT 420 

TCTGACCACG ACGAGCAGCA GTTACAAAAA CCTTCCCCAG GCTGGACGTG GTGGCTCAGC 480 

CTTGTAATCC CAGCACTTTG GGAGGCCAAG GTGGGTGGAT CACTTGAGGT CAGGAGTTCG 540 

AGACAGCCTG GCCAACATGA TGAAACCCCA TGTGTACTAA AAATACAAAA AATTAGCCGG 600 

GCGTGGTAGC GGGCGCCTGT AGTCCCAGCT ACTCGGGAGG CTGAGGCAGG AGAATGGCGT 660 

GAACCCGGGA GCGGAGCTTG CAGTGAGCCG AGATCGCGCC ACTGCACTCC AGCCTGGGCG 720 

15 ACAGAGCGAG ACTCCGTCTC AAAAAAAAAA AAAAAAAAAA AAAAAATACA AAAATTAGCC 780 

GCGTGGTGGC CCACGCCTGT AATCCCAGCT ACTCGGGAGG CTAAGGCAGG AAAATTGTTT 840 

GAACCCAGGA GGTGGAGGCT GCAGTGAGCT GAGATTGTGC CACTTCACTC CAGCCTGGGT 900 

GACAAAGTGA GACTCCGTCA CAACAACAAC AACAAAAAGC TTCCCCAACT AAAGCCTAGA 960 

AGAGCTTCTG AGGCGCTGCT TTGTCAAAAG GAAGTCTCTA GGTTCTGAGC TCTGGCTTTG 1020 

25 CCTTGGCTTT GCAAGGGCTC TGTGACAAGG AAGGAAGTCA GCATGCCTCT AGAGGCAAGG 1080 

AAGGGAGGAA CACTGCACTC TTAAGCTTCC GCCGTCTCAA CCCCTCACAG GAGCTTACTG 1140 

GCAAACATGA AAAATCGGGG 1160 

30 

(2) INFORMATION FOR SBQ ID NO: 38: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 97 amino acids 
35 (B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



20 



40 



50 



<ii) MOLECULE TYPE: peptide 



(Xi) SEQUENCE DESCRIPTION: SEQ ID NO:38: 



45 Met Arg lie Ser Ala Thr Leu Leu Cys Leu Leu Leu He Ala Ala Ala 

15 10 15 



Phe Ser He Gin Val Trp Ala Gin Pro Asp Gly Pro Asn Ala Ser Thr 
20 25 30 

Cys Cys Tyr val Lys Lys Gin Lys He Pro Lys Arg Asn Leu Lys Ser 
35 40 45 



Tyr Arg Arg He Thr Ser Ser Arg Cys Pro Trp Glu Ala Val He Phe 
55 50 55 60 

Lys Thr Lys Lys Gly Met Glu Val Cys Arg Glu Ala His Gin Lys Trp 
65 70 75 80 

60 Val Glu Glu Ala He Ala Tyr Leu Asp Met Lys Thr Pro Thr Pro Lys 

85 90 95 



Pro 



65 
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(2) INFORMATION FOR SEQ ID NO: 39: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 593 base pairs 
5 <B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) r APOLOGY: linear 



10 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 39: 

ACTGAAGCCA GCTCTCTCAC TCTCTTTCTC CACCATGAGG ATCTCTGCCA CGCTTCTGTG 60 

15 CCTGCTGCTC ATAGCCGCTG CTTTCAGCAT CCAAGTGTGG GCCCAACCAG ATGGGCCCAA 120 

TGCATCCACA TGCTGCTATG TCAAGAAACi AAGATCCCC AAGAGGAATC TCAAGAGCTA 180 

— CAGAAGGATC ACCAGTAGTC GGTGTCCCTG GGAAGCTGTT ATCTTCAAGA CAAAGAAGGG 240 

20 

CATGGAAGTC TGTCGTGAAG CCCATCAGAA GTGGGTCGAG GAGGCTATAG CATACTTAGA 300 

CATGAAAACC CCAACTCCAA AGCCTTGAAG AAATGTGCCT GAACAGAAAC CAACCTAGGA 360 

25 GCCAAGAAGC AAAAATTCCT CACCGCTGTT CTTTCTGAGA ACTGTTGATG AAATGTGTTG 420 

ATCACGGTCC TAAGGGATAG GAGCTGTCTG TAGGAATGTG AAACAGTCAC GCCTAAGGAA 480 

TGGTCTTTAA GTTATTAATA TTTTTATTTA ATTAGCCATG TACTTTGGTG TGATTTGAAT 540 

30 

GTAAAGCTCT GGAGACCTCA TGTCACTTTA ACATTGTGTT AGCTGCAGAA TTC 593 
(2) INFORMATION FOR SEQ ID NO: 40: 

35 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 72 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

40 

(ii) MOLECULE TYPE: peptide 



45 (Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 40: 

Asp Ser Val Ser lie Phe lie Thr Cys Cys Phe Asn Val lie Asn Arg 
15 10 15 

50 Lys lie Pro lie Gin Arg Leu Glu Ser Tyr Thr Arg lie Thr Asn lie 

20 25 30 



55 



Gin Cys Pro Lys Glu Ala val lie Phe Lys Thr Gly Lys Glu Val Cys 
35 40 45 

Ala Asp Pro Lys Glu Arg Trp Val Arg Asp Ser Met Lys His Lys Asp 
50 55 60 



Gin lie Phe Gin Asn Leu Lys Pro 
60 65 70 



(2) INFORMATION FOR SEQ ID NO: 41: 

65 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 15 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: linear 
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(ii) MOLECULE TYPE; peptide 

5 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 41: 

Asn Leu Ala Lys Gly Lys Glu Glu Ser Leu Asp Ser Asp Leu Cys 
15 10 15 

10 (2) INFORMATION FOR SEQ ID NO: 42: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 32 amino acids 

(B) TYPE: amino acid 

15 (C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 



20 



(ii) MOLECULE TYPE: peptide 



(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 42: 



Cys Asn Gin Val Glu Val lie Ala Thr Leu Lys Asp Gly Arg Lys lie 
25 1 5 10 15 

Cys Leu Asp Pro Asp Ala Pro Arg lie Lys Lys lie val Gin Lys Lys 
20 25 30 

30 (2) INFORMATION FOR SEQ ID NO: 43: 

<i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 96 base pairs 

(B) TYPE: nucleic acid 
35 (C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



40 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 43: 

TGCAACCAAG TCGAAGTGAT AGCCACACTG AAGGATGGGA GGAAAATCTG CCTGGACCCA 60 
GATGCTCCCA GAATCAAGAA AATTGTACAG AAAAAA 96 

45 
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CLAIMS 

We claim: 

1. A method of screening for AHA compounds comprising the steps of: 

a) contacting a compound with radiolabeled heparin/heparan sulfate and 
5 heparanase; 

b) maintaining the compounds in contact with the radiolabeled heparin/heparan 
sulfate and heparanase for a time and under such conditions sufficient to allow 
inhibition of heparanase activity; 

c) detecting inhibition of heparanase activity (a compound that gives 50% 
10 inhibition at a concentration of 1 uM or less); and 

d) selecting compounds that inhibit heparanase activity. 

2. A method according to claim 1 wherein oie heparanase is recombinant. 

15 3. A heparanase having an isoelectric point of less than 5.5 and possessing activity greater 
than 20 units heparanase activity per pg protein. 



4. A heparanase according to Claim 3, having an isoelectric point of about 4.8 - 5.1. 

20 5. A heparanase purified to apparent homogeneity, as in claim 3 V prepared in the presence of 
reducing conditions and activated with transglutaminase, having an amino acid sequence selected 
from the group consisting of SEQ. ID NO. 1, SEQ. ID NO: 3, SEQ. ID NO: 5 or SEQ. ID NO: 
7. 



25 6. A heparanase purified to apparent homogeneity, as in claim 3, prepared in the presence of 
reducing conciitiorjs, fiavftg an amino acid sequence of §gQ t H> NQ; 1. 



7. A heparanase purified to apparent homogeneity, as in claim 3, prepared in the presence of 
reducing conditions and activated with transglutaminase, having an amino acid sequence of 
30 SEQ. ID NO: 3. 



8v A heparanase according to Claim 4, purified to apparent homogeneity, prepared in the 
presence of reducing conditions and activated with transglutaminase. 

35 9. A heparanase, as in claim 3, prepared by recombinant means, activated ,vith 

PWWglWMBUWK and Wm m Main 0 fttitf SfllUrace selectee) rrqm, thf? grqup consisting of SEQ. 
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ro NO: l t SEQ. ID NO: 3, SEQ. ID NO: 5 or SEQ. ID NO: 7. 
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10. A heparan ase, as in claim 3, prepared by recombinant means, activated with 
transglutaminase and having an amino acid sequence of SEQ. ID NO: 1. 

11. A heparanase, as in claim 3, prepared by recombinant means, activated with 
transglutaminase and having an amino acid sequence of SEQ. ID NO: 3. 

12. A heparanase purified to apparent homogeneity, as in claim 3, prepared in the presence 
of reducing conditions and activated with transglutaminase, and having an amino acid sequence 
selected from the group consisting of SEQ. ID. NO: 12, SEQ. ID. NO: 14; SEQ. ID. NO: 16, 
SEQ. ID. NO: 18, SEQ. ID. NO: 20, SEQ. ID. NO: 22 and SEQ. ID. NO: 24. 



13. A heparanase purified to apparent homogeneity, as in claim 3, prepared in the presence 
of reducing conditions and activated with transglutaminase, and having an amino acid sequence 
selected from the group consisting of SEQ. ID. NO: 26, SEQ. ID. NO: 28, SEQ. ID. NO: 30, 
SEQ. ID. NO: 32; SEQ. ID. NO: 34, SEQ. ID. NO: 36. SEQ. ID. NO: 38 and SEQ.ID. NO: 40. 

14. A method according to claim 1 wherein the heparanase is purified to apparent 
homogeneity, prepared in the presence of reducing conditions and activated with 
transglutaminase, and having an amino acid sequence selected from the group consisting of 
SEQ. ID. NO: 12, SEQ. ID. NO: 14; SEQ. ID. NO: 16, SEQ. ID. NO: 18, SEQ. ID. NO: 20, 
SEQ. ID. NO: 22 and SEQ. ID. NO: 24. 



15. A method according to claim 1 wherein the heparanase is purified to apparent 
homogeneity, prepared in the presence of reducing conditions and activated with 
transglutaminase, and having an amino acid sequence selected from the group consisting of 
SEQ. ID. NO: 26, SEQ. ID. NO: 28, SEQ. ID. NO: 30, SEQ. ID. NO: 32; SEQ, ID. NO: 34, 

SEQ. ID. NO: 36, SEQ. ID. NO: 38 and SEQ. ID. NO: 40. 

16. A method according to claim 1 wherein the heparanase is purified to apparent 
homogeneity, prepared in the presence of reducing conditions and activated with 
transglutaminase, and having an amino acid sequence selected from the group consisting of 
SEQ. ID NO: 1, SEQ. ID NO: 3, SEQ. ID NO: 5 or SEQ. ID NO: 7. 



17. A peptide having an amino acid sequence of SEQ. ID. NO: 42. 
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